Buckeye Corpus

Register | Logout"; }else{echo "Login";} ?>

Buckeye Corpus

Ohio State University logo

Updates

Below is a description of changes since the initial release and patches with which to update prior versions. The second release of the corpus, with all 40 speakers, is now available. To upgrade to version 2.0, simply download the files for 20 newly-added speakers.

Verison
Release Date
Size
Update 2.0 5-14-07 652 MB
Release 2.0 2-28-07 4.1 GB
Patch 1.1.0 4-28-06 2.0 MB

Change Log



Update 2.0

Corrections to errors in the .words files
Below are the zip files affected:
s0204b.zip s1001b.zip s1103b.zip s1503a.zip s2501b.zip s3201a.zip s0502a.zip s1002a.zip s1104b.zip s1801b.zip s2701a.zip s3303b.zip s0602b.zip s1002b.zip s1302b.zip s1802a.zip s2801a.zip s3402a.zip s0701a.zip s1003a.zip s1303a.zip s1802b.zip s2802a.zip s3602a.zip s0702b.zip s1003b.zip s1401b.zip s2101b.zip s2902b.zip s3703a.zip s0801a.zip s1101a.zip s1501a.zip s2301a.zip s2903b.zip s3801a.zip s0801b.zip s1101b.zip s1501b.zip s2301b.zip s3001b.zip s3902a.zip s0903b.zip s1102b.zip s1502a.zip s2302b.zip s3002a.zip s4002b.zip s1001a.zip s1103a.zip s1502b.zip s2402a.zip s3102a.zip


Release 2.0

Changes From Release 1.1.0:
Added all files (.wav, .phones, .words) for the remaining 20 speakers, who are listed below.
s01, s05, s06, s07, s08, s09, s18, s19, s23, s27
s28, s29, s30, s31, s34, s36, s37, s38, s39, s40

Release 1.1.0

Changes From Release 1.0:
In the *.phones files, the symbol ‘+1’ appeared at the end of some phones in the following directories: s0201b, s0202a, s0202b, s0203b, s0204a, s0401b, s0402a, s0402b, s0403a, s0403b, s0404a, s1001a, s1001b, s1002a, s1002b, s1003a, s1003b, s1004a, s1301a, s1301b, s1302b, s1303a, s1303b, s1304a, s1401a, s1401b, s1601b, s1602b, s1603a, s1604a, s1701a, s1701b, s1702a, s1702b, s1703a, s1703b, s2001b, s2002a, s2002b, s2003a, s2003b, s2004a, s2601a, s2601b, s2602a, s2602b, s2603a, s2603b, s3201a, s3202a, s3502b
All the instances were removed, and the .phones, .words and text transcripts were updated.
Directories s3503a and s3502c were included as subdirectories of s3504a, and s3503a.words and s3503b.words had no content.
s3502c was renamed to s3503a
s3503a was renamed to s3503b
All directories for speaker s35 now contain *. words files, and the newly renamed directories can now be seen for speaker s35.
Several .words files had a few lines in which words appeared with no phonetic transcription. The affected directories were:
s1003b, s1301b, s1302a, s1302b, s1303a, s1303b, s1304a, s3301a
The .words files were re-written to ensure that pronunciations existed for the duplicated words. Transcript files were also re-written for each of the affected directories.
An illegal phonetic symbol appeared in the .phones file for speaker directory s2403a.
The symbol was replaced with the correct one.


buckeye leaf ©2005 Department of Psychology