Comparison and benchmark of name-to-gender inference services.

PeerJ Comput Sci

University of Applied Sciences, Berlin, Germany.

Published: July 2018

The increased interest in analyzing and explaining gender inequalities in tech, media, and academia highlights the need for accurate inference methods to predict a person's gender from their name. Several such services exist that provide access to large databases of names, often enriched with information from social media profiles, culture-specific rules, and insights from sociolinguistics. We compare and benchmark five name-to-gender inference services by applying them to the classification of a test data set consisting of 7,076 manually labeled names. The compiled names are analyzed and characterized according to their geographical and cultural origin. We define a series of performance metrics to quantify various types of classification errors, and define a parameter tuning procedure to search for optimal values of the services' free parameters. Finally, we perform benchmarks of all services under study regarding several scenarios where a particular metric is to be optimized.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924484	PMC
http://dx.doi.org/10.7717/peerj-cs.156	DOI Listing

Publication Analysis

Top Keywords

benchmark name-to-gender

name-to-gender inference

inference services

comparison benchmark

services

services increased

increased interest

interest analyzing

analyzing explaining

explaining gender

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!