Citing LingPipe
If you want to cite the LingPipe software, we suggest following the Chicago Manual of Style's guideline 17.356 for citing web sites in scientific articles. For the bibliography, they suggest the following form:
- Alias-i. 2008. LingPipe 3.9.3. http://alias-i.com/lingpipe (accessed October 1, 2008)
For inline citations, that would be:
- (Alias-i 2008)
Papers Mentioning LingPipe
We list papers that we wrote, as well as papers written by others.
Papers from Alias-i
We've spent much more time writing code, javadoc and tutorials than papers, but we have produced a few to go along with workshops or bakeoffs.
- Carpenter, Bob. 2007. LingPipe for 99.99% Recall of Gene Mentions. Proceedings of the 2nd BioCreative workshop. Valencia, Spain. [pdf]
- Carpenter, Bob. 2006. Character language models for Chinese word segmentation and named entity recogntion. Proceedings of the 5th ACL Chinese Special Interest Group (SIGHan). Sydney, Austrlia. [pdf]
- Carpenter, Bob. 2005. Scaling High-Order Character Language Models to Gigabytes. In Proceedings of the Association for Computational Linguistics Workshop on Software. Ann Arbor. [pdf]
- Carpenter, Bob. 2004. Phrasal Queries with LingPipe and Lucene. In Proceedings of the 13th Meeting of the Text Retrieval Conference (TREC). Gaithersburg, Maryland. [pdf]
- Carpenter, Bob. 2004. Orthographic variation with Lucene. In O. Gospodnetic and E. Hatcher, Lucene in Action. Manning Press.
Third-Party Papers
If we missed your paper and you'd like to see it in this list,
please drop us a line at lingpipe@alias-i.com
.
We're speding some time every release going through Google Scholar, but
we've only considered 100 or so of the several hundred results
presented (450 as of this release).
- Beneti, Aspasia, Woiyl Hammoumi, Eric Hielscher, Martin Müller, and David Persons. 2006. Automatic generation of fine-grained named entity classifications. Technical report, University of Amsterdam. [pdf]
- Bey, Youcef, Christian Boitet, and Kyo Kageura. 2006. The TRANSBey Prototype: An Online Collaborative Wiki-Based CAT Environment for Volunteer Translators. In Proceedings of LREC. [pdf]
- Bischoff, Kerstin, Thomas Mandl and Christa Womser-Hacker. 2007. Blind Relevance Feedback and Named Entity Based Query Expansion for Geographic Retrieval at GeoCLEF 2006. In Evaluation of Multilingual and Multi-modal Information Retrieval, CLEF 2007. Springer. publisher link]
- Bradford, R. B. 2006. Relationship Discovery in Large Text Collections Using Latent Semantic Indexing. In Proceedings of SDM 06. [pdf].
- Buscaldi, Davide and Paolo Rosso. 2007. On the Relative Importance of Toponyms in GeoCLEF. In Proceedings of CLEF 2007. [pdf]
- Chambers, Nate and Shan Wang. 2006. Temporal Ordering of Event Descriptions. CS 229 Class Project. Stanford University. [pdf]
- Chen, Jiangping, He Ge, Y. Wu, and S. Jiang. 2004. UNT at TREC 2004: Question Answering Combining Multiple Evidences. Text Retrieval Conference (TREC). [pdf]
- Chen, Jiangping, Ping Yu and He Ge. 2005. UNT 2005 TREC QA Participation: Using Lemur as IR Search Engine. In Proceedings of TREC 2005. [pdf]
- Clarke, James and Mirella Lapata. 2007. Modelling Compression with Discourse Constraints. In Proceedings of EMNLP/CoNLL 2007. [pdf]
- Corbett, Peter, Colin Batchelor, and Simon Teufel. 2007. Annotation of chemical named entities. In Proceedings of BioNLP 2007, 57-64, Prague. [pdf]
- D'Avanzo, Ernesto and Bernardo Magnini. 2005. A Keyphrase-Based Approach to Summarization: the LAKE System at DUC-2005. In Document Understanding Conference. [pdf]
- Dale, Robert and Pawel Mazur. 2007. The Semantics of Temporal Expressions. In Proceedings of the Twentieth Australian Joint Conference on Artificial Intelligence. 435-444. Gold Coast, Queensland, Australia. [pdf]
- Damianos, Laurie, Jay Ponte, Steve Wohlever, Florence Reeder, David Day, George Wilson, and Lynette Hirschman. 2002. MiTAP for Bio-Security: A Case Study. AI Magazine 23(4):13-29. [pdf]
- Denecke, K. 2008. Using SentiWordNet for multilingual sentiment analysis. In IEEE 24th International Conference on Data Engineering Workshop (ICDEW). 507-512. [publisher site]
- Deschacht, K., M. F. Moens, and W. Robeyns. 2007. Crossmedia entity recognition in nearly parallel visual and textual documents. 8th RIAO Conference on Large-Scale Semantic Access.
- Duong, Deborah, Ben Goertzel, Jim Venuto, Ryan Richardson, Shawn Bohner, and Edward Fox. 2006. Support Vector Machines to Weight Voters in a Voting System of Entity Extractors. In Proceedings of the International Joint Conference on Neural Networks (IJCNN). [publishers page]
- Favre, Benoît, B Favre, Frédéric Béchet, and Pascal Nocéra. 2005. Robust Named Entity extraction from large spoken archives. In Proceedings of HLT/EMNLP. 491-498. Vancouver. [pdf]
- Gasperin, Caroline. 2006. Semi-supervised anaphora resolution in biomedical texts. In Proceedings of BioNLP Workshop on Linking Natural Language Processing and Biology at HLT-NAACL. 96-103. New York City. [pdf]
- Geoffrey, Andogah. 2007. GIR [Geographic Information Retrieval] Experimentation. In Evaluation of Multilingual and Multi-modal Information Retrieval, CLEF 2006. Springer. [publisher link].
- He, Ying and Mehmet Kayaalp. 2006. A Comparison of 13 Tokenizers on MEDLINE. Lister Hill National Center for Biomedical Communications Technical Report LHNCBC-TR-2006-003. [pdf]
- Iftene, Adrian and Alexandra Balahur-Dobrescu. 2008. Answer Validation on English and Romanian Languages. In Proceedings of CLEF 2008. [pdf]
- Iftene, Adrian and Alexandra Balahur-Dobrescu. 2007. Hypothesis Transformation and Semantic Variability Rules Used in Recognizing Textual Entailment. In Proceedings of the Association for Computational Linguistics (ACL). [pdf]
- Iftene, Adrian and Alexandra Balahur-Dobrescu. 2007. UAIC Participation at AVE 2007. In CLEF 2007, LNCS 5152, 395-403. Springer.
- Kabiljo, Renata and Adrian J. Shepherd. 2008. Protein Name Tagging in the Immunological Domain. In Proceedings of SMBM 2008. [pdf]
- Kaljurand, Kaarel, Fabio Rinaldi, James Dowdall, and Michael Hess. 2004. Exploiting Language Resources for Semantic Web Annotations. In Proceedings of LREC. [CiteSeer]
- Kashani, Mehdi M. and Fred Popowich. 2006. Pronoun Generation for Text Summarization and Question Answering. In Proceedings of 5th Slovenian and 1st international Language Technologies Conference. [pdf].
- Leaman, Robert and Graciela Gonzalez. 2008. Banner: an executable survey of advances in biomedical named entity recognition. In Proceedings of the Pacific Symposium on Biocomputing (PSB) 13:652-663. [pdf]
- Li, Yi, Alistair Moffat, Nicola Stokes, and Lawrence Cavedon. 2006. Exploring Probabilistic Toponym Resolution for Geographical Information Retrieval. In 3rd Workshop on Geographic Information Retrieval (GIR). [pdf]
- Mason, Joshua, Kathryn Watkins, Jason Eisner, and Adam Stubblefield. 2006. A natural language approach to automated cryptanalysis of two-time pads. In Proceedings of the 13th ACM Conference on Computer and Communications Security. [pdf]
- Mazur, Paweł and Robert Dale. 2007. The DANTE Temporal Expression Tagger. In Proceedings of the 3rd Language and Technology Conference. Poznan, Poland. [pdf]
- Mazur, Paweł and Robert Dale. 2007. A Rule Based Approach to Temporal Expression Tagging. In Proceedings of the International Multiconference on Computer Science and Information Technology (IMCSIT) 2nd International Symposium: Advances in Artificial Intelligence and Applications. Wisla, Poland. [pdf]
- Melli, Gabor, Yang Wang, Yudong Liu, Mehdi M. Kashani, Zhongmin Shi, Baohua Gu, Anoop Sarkar, and Fred Popowich. 2005. Description of SQUASH, the SFU Question Answering Summary Handler for the DUC-2005 Summarization Task. In Proceedings of the Document Understanding Conference (DUC).
- Molla, Diego and Menno Van Zaanen. 2005. Learning of graph rules for question answering. In Proceedings of ALTW. [pdf]
- Neumann, Günter and Bogdan Sacaleanu. 2005. Experiments on Robust NL Question Interpretation and Multi-layered Document Annotation for a CrossLanguage Question/Answering System. In Multilingual Information Access for Text, Speech and Images, CLEF 2004. Springer. [publisher link].
- Ofoghi, Bahadorreza, John Yearwood and Liping Ma. 2007. The Impact of Semantic Class Identification and Semantic Role Labeling on Natural Language Answer Extraction In Advances in Information Retrieval (ECIR) LNCS 4956. 430--437. Springer. [publisher link]
- Klinger, Roman, Corinna Kolárik, Fluck, Juliane, Hofmann-Apitius, Martin, and Friedrich, Christoph M. 2008. Detection of IUPAC and IUPAC-like chemical names. Bioinformatics 24(13):i268-i276.
- Perea-Ortega, José M., Miguel Angel García Cumbreras, Manuel García-Vega, and Luis Alfonso Ureña López. 2008. SINAI-GIR System: University of Jaén at GeoCLEF 2008. In Proceedings fo GeoCLEF 2008. [pdf]
- Schilder, Frank, Andrew McCulloh, Bridget Thomson McInnes, and Alex Zhou. 2005. TLR at DUC: Tree similarity. Proceedings of the Document Understanding Conference (DUC). [pdf]
- Strötgen, Robert, Thomas Mandl, and René Schneider. 2006. A Fast Forward Approach to Cross-Lingual Question Answering for English and German. In Accessing Multilingual Information Repositories, CLEF 2005. Springer. [publisher link].
- Stokes, Y.L.N., L. Cavedon, and A. Moffat. 2006. NICTA I2D2 Group at GeoCLEF 2006. Proceedings of CLEF. [pdf]
- Sureka, Ashish, Sudripto De, and Kishore Varma. 2008. Mining Automotive Warranty Claims Data for Effective Root Cause Analysis. In Database Systems for Advanced Applications, LNCS 4947. 621-626. Springer. [publisher link]
- Tratz, Stephen, Antonio Sanfilippo, Michelle Gregory, Alan Chappell, Christian Posse and Paul Whitney. 2007. PNNL: A Supervised Maximum Entropy Approach to Word Sense Disambiguation. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007). 264--267. Prague. [pdf]
- Vlachos, Andreas. 2006. Active annotation. In Adaptive Text Extraction and Mining (ATEM). [pdf]
- Vlachos, Andreas and Caroline Gasperin. 2006. Bootstrapping and Evaluating Named Entity Recognition in the Biomedical Domain. In Proceedings of the BioNLP Workshop at HLT-NAACL. [pdf].
- Vlachos, Andreas, Caroline Gasperin, Ian Lewin, and Ted Briscoe. 2006. Bootstrapping the Recognition and Anaphoric Linking of Named Entities in Drosophila Articles. In Proceedings of the Pacific Symposium on Biocomputing 11:100-111. [pdf]
- Vlachos, Andreas. 2007. Evaluating and combining biomedical named entity recognition systems. In Proceedings of ACL Workshop. [pdf]
- Wang, Hudong, Shannon Bradshaw and Marc Light. 2005. Automatic highlighting of bioscience literature. In Proceedings of BioLink.
- Zhu, Weizhong, Chaomei Chen, and Robert B. Allen. 2006. Visualizing the Evolution of Social Networks. Poster presented at IST Research Day 2006. Drexel University. [pdf]
Patent Applications
Yes, we've even been mentioned in 3rd-party patent applications! One doesn't need to own all the intellectual property mentioned in a patent to get a patent.
- Frankie E. D. Patman and Charles Kinston Williams. 2007. Filtering extracted personal names. U.S. Patent Application 20070005578A1. [Google Patents]
Courses using LingPipe
I know there are more out there, but these are the only syllabi I could find online (search: <syllabus lingpipe site:.edu>). Let us know if you are using us in your class, especially if you'd like help.
- William Lewis. 2008. Ling 570: Shallow Processing Techniques for Natural Language Processing. University of Washington. [syllabus]