Description |
-
Due to the rise of very large, heterogeneous collections, increasingly sophisticated multilingual services, and expanding high performance computing infrastructure, we are now in a position to begin studying 4000 years of linguistic data from around the world, tracing change within languages, the interaction of languages, the evolution and circulation of ideas, and the patterns of human society. ... read moreLanguage has been an impenetrable barrier we can reach any point on the globe in a matter of hours but the amount of time required to master a new language remains unchanged. We can, however, now begin to work with far more languages than we could ever study, much less master. We are now in a position to pursue broader questions and to pursue these with greater rigor than would have been possible in print. A great deal of work remains to be done, however, for very large collections are not scientific corpora and need extensive processing, and many written sources do not yet lend themselves to optical character recognition. Simply scaling up existing systems to analyze millions of books poses software engineering challenges. Perhaps most important of all, we need to train a new generation of researchers who can bridge the intellectual gaps between the relevant computational methods and new research for social, behavioral and economic sciences.read less
|
This object is in collection