View Content #12170
Contentid | 12170 |
---|---|
Content Type | 1 |
Title | Google Launches Books Ngram Viewer, an Amazing Corpus Tool |
Body | From http://www.nytimes.com/2010/12/17/books/17words.html?_r=1&hp In 500 Billion Words, New Window on Culture By PATRICIA COHEN December 16, 2010 With little fanfare, Google has made a mammoth database culled from nearly 5.2 million digitized books available to the public for free downloads and online searches, opening a new landscape of possibilities for research and education in the humanities. The digital storehouse, which comprises words and short phrases as well as a year-by-year count of how often they appear, represents the first time a data set of this magnitude and searching tools are at the disposal of Ph.D.’s, middle school students and anyone else who likes to spend time in front of a small screen. It consists of the 500 billion words contained in books published between 1500 and 2008 in English, French, Spanish, German, Chinese and Russian. Read the full New York Times article at http://www.nytimes.com/2010/12/17/books/17words.html?_r=1&hp --- The Science article is available at http://www.sciencemag.org/content/early/2010/12/15/science.1199644 --- Read a summary in the Scientific American at http://www.scientificamerican.com/article.cfm?id=google-books-culture --- Larry Ferlazzo has put together a list of The Best Posts To Help Understand Google’s New “Books Ngram Viewer” at http://larryferlazzo.edublogs.org/2010/12/17/the-best-posts-to-help-understand-googles-new-books-ngram-viewer --- How can you use this in your classroom? Get some ideas in this blog post: http://peterpappas.blogs.com/copy_paste/2010/12/how-to-quantify-culture-google-ngram-viewer-explore-500-billion-published-words.html --- The tool itself is available at http://ngrams.googlelabs.com . It is easy to use; enjoy your exploring! |
Source | Various |
Inputdate | 2010-12-19 01:45:55 |
Lastmodifieddate | 2010-12-19 01:45:55 |
Expdate | Not set |
Publishdate | 2010-12-20 00:00:00 |
Displaydate | Not set |
Active | 1 |
Emailed | 1 |
Isarchived | 1 |