OpenEdition Lab

0

BILBO : Automatic reference labeling

BILBO is an open source software for automatic annotation of bibliographic reference. It provides the segmentation and tagging of input string. It is principally based on Conditional Random Fields (CRFs), machine learning technique to...

1

Indexing Inex SBS Corpus in Terrier

In this blog, we demonstrate how we can convert Inex Social Book Search corpus from xml format into Trec Collection format; this format is valid as input for Terrier (compatible with TRECCollection parser) where...