The Cambridge International Corpus is a collection of over 2 billion words of real spoken and written English. The texts are stored in a database that can be searched to see how English is used. From Wikipedia
A year-long corpus study guided editors to add only new terms with broad, sustained usage, from social-media slang to policy jargon