We're pleased to announce a new release of the Wordbank database, bringing us to more than 100,000 CDI administrations! This release includes data from Japanese, Estonian, Saudi Arabic, and Catalan, as well as new data from Korean, Finnish, French, and French-English bilinguals.
Thursday, February 8, 2024
Tuesday, March 7, 2023
A meta-analysis of outcomes for late talkers
https://www.crd.york.ac.uk/PROSPERO, registration CRD42023394687.
Emma Hayiou-Thomas, emma.hayiou-thomas@york.ac.uk
Philip Dale, dalep@unm.edu
Sunday, January 29, 2023
Educational materials
Thursday, August 11, 2022
Wordbank updates!
We are very pleased to announce a major update to Wordbank, including some significant changes to the database structure. We are also adding more than 10 new languages and data from many thousands of children.
We now include data from multilingual children and children with diagnosed developmental disorders (as well as functionality for identifying these children via the shiny apps and the wordbankr API).
The prior version of the Wordbank database will remain up and available for queries via the wordbankr 0.3.0 API (or earlier) for a period of at least 6 months, but if you upgrade wordbankr you will begin accessing the new Wordbank data.
Wordbank data will also be versioned going forward so that older snapshots of the database will be available via S3 snapshots (see Documentation page).
Wednesday, April 7, 2021
Wordbank Book!
We are very pleased to announce publication of the Wordbank Book, "Variability and Consistency in Early Language Learning," now available from MIT Press (2021). The book brings together many different ways of looking at data in the Wordbank database, in service of characterizing how children vary as well as shared patterns of learning. The book is also available free online at http://wordbank-book.stanford.edu, and all of the code necessary to generate it from the Wordbank data can be found at http://github.com/langcog/wordbank-book.
Wednesday, March 7, 2018
More languages (and some naming changes)
First, the arrival of several more languages and datasets, including French (European), more Korean data, as well as more Hebrew data and Spanish (European) in the works.
Second, we have a new licensing standard such that some datasets can be licensed Creative Commons for Non-Commercial use. These datasets are marked on the contributors page.
Finally, because of the new data, we have some new naming conventions for languages. "English" is now "English (American)"; this convention will generally be followed as "Language (Country/Region)." These are breaking changes unfortunately, we apologize for the inconvenience and are working on past database images available for purposes of reproducibility.