19

I'm interested in being able to identify a name like "Nick" as a short form of a name like "Nicholas" in a data set.

Is there any kind of open or somewhat standardized mapping of names and nicknames available? I am fine if it's limited to Western or American English names, but international data is even better.

shadowtalker
  • 331
  • 1
  • 2
  • 7
  • how does this bounty work? i feel like my answer is good enough, but i'm sure it could be better. are you asking for more answers? – albert Apr 11 '18 at 20:29
  • 2
    @albert, no, I'm not asking for more answers, but you obviously can edit existing answer, if you want. One just can't award an answer in first 24 hours after bounty assignment, even if the bounty reason is "award existing answer". Of course, if somebody will post another good answer today, I'll start another bounty. Current bounty is a share of this one. – Stanislav Kralin Apr 12 '18 at 08:42
  • ah, i guess i'm just confused at how bounties work. – albert Apr 12 '18 at 15:59

3 Answers3

23

Nicknames and Diminutive Names Lookup (CSV)
CSV, Database of Common Nicknames and its GitHub Repository: Common Nickname CSV (Database)
Name to Nickname CSV and Nickname to Name CSV

albert
  • 11,885
  • 4
  • 30
  • 57
  • 3
    There's also this list that was extracted from US Census data: https://web.archive.org/web/20181022154748/https://deron.meranda.us/data/nicknames.txt – glenviewjeff Apr 27 '19 at 22:13
  • 1
    @glenviewjeff I'm wary to trust that list. It makes some weird choices like mapping "Sam" to Samuel but not Samantha, "Izzy" to Isadore but not Isabelle, etc. – user3932000 Jul 31 '19 at 17:27
3

Wikidata has a nickname property https://www.wikidata.org/wiki/Property:P1449

which can be used with given names https://www.wikidata.org/wiki/Q202444 or https://www.wikidata.org/wiki/Q82799

which you can query with SPARQL https://query.wikidata.org

This approach may give you a more "real-life" dataset, and statistics about how many nicknames map back to given names, etc.

philshem
  • 17,647
  • 7
  • 68
  • 170
  • This is a great idea, but when I tried some example queries at https://w.wiki/4qa2, a lot of the nicknames I got were weird. Like "The Little Assassin" (https://www.wikidata.org/wiki/Q233339). Maybe with some filtering though this could be useful? – Nick Crews Feb 15 '22 at 22:17