Basic Search
Browse
Resource Inspector
Title: Building a Dynamic Lexicon from a Digital Library
Date: 2008
Creator: Bamman, David
Creator: Crane, Gregory
Format: application/pdf
Organizations: Perseus Project
Topics: Digital libraries
Topics: Latin language
Topics: Latin language-data processing
Topics: Syntactic parsing
Topics: Natural language processing
Topics: Lexicography

Access this object:help
-pdf (default)
Title: Building a Dynamic Lexicon from a Digital Library
Citable URL: http://hdl.handle.net/10427/42686
Author: Bamman, David; Crane, Gregory
Date: 2008
Citation: Bamman, David, and Gregory Crane. "Building a Dynamic Lexicon from a Digital Library." In Proceedings of the 2008 Joint Conference on Digital Libraries, Pittsburgh, Pennsylvania, June 16-20, 2008, preprint. New York: Association for Computing Machinery, 2008. Available from Tufts Digital Library, Digital Collections and Archives, Medford, MA. http://hdl.handle.net/10427/42686
Rights: http://www.acm.org/publications/policies/copyright_policy

View the PDF File: Building a Dynamic Lexicon from a Digital Library (opens in a new window)

Abstract: We describe here in detail our work toward creating a dynamic lexicon from the texts in a large digital library. By leveraging a small structured knowledge source (a 30,457 word treebank), we are able to extract selectional preferences for words from a 3.5 million word Latin corpus. This is promising news for low-resource languages and digital collections seeking to leverage a small human investment into much larger gain. The library architecture in which this work is developed allows us to query customized subcorpora to report on lexical usage by author, genre or era and allows us to continually update the lexicon as new texts are added to the collection.