Datasets
Redistributable public datasets for language processing, names, lexicons, validation, and authorized security testing.
18 results
Redistributable public datasets for language processing, names, lexicons, validation, and authorized security testing.
18 results
18 listings
18 listings
Cost
Free
A CSV package for Scotland baby names, including the all-years CSV and the 2025 workbook converted to CSV.
Cost
Free
A CSV package for national, regional, department, and first-name list tables from INSEE.
Cost
Free
A CSV package converted from the 2024 ONS boys and girls baby-name workbooks, preserving every worksheet.
Cost
Free
A CSV package preserving official Ireland and Northern Ireland baby-name tables.
Cost
Free
A CSV package preserving national and provincial Canadian baby-name tables.
Cost
Free
A normalized CSV package for historical Census surname data plus 1990 first-name and last-name frequency files.
Cost
Free
A CSV package converted from the Census 2020 first-name workbooks, preserving each source worksheet as a CSV table.
Cost
Free
A normalized CSV package for territory baby-name counts by territory, year, sex, name, and count.
Cost
Free
A normalized CSV package for baby-name counts by state, year, sex, name, and count.
Cost
Free
A normalized CSV package for national baby-name counts by year, sex, name, and count.