An Examination of Statistical Disclosure Issues Related to Publication of Aggregate Statistics in the Presence of a Known Subset of the Dataset Using Baseball Hall of Fame Ballots

Gregory J Matthews, Pétala Gardênia da Silva Estrela Tuy, Robert K. Arthur

Research output: Contribution to journalArticlepeer-review

Abstract

Each year the members of the Baseball Writers Association of America (BBWAA) vote for eligible former players to be inducted into the Baseball Hall of Fame. The BBWAA tabulates and releases vote totals, but individual ballots remain private. However, many voters forgo their ballot privacy to publish their ballots through various media channels. These publicly available ballots can be aggregated to create a subset of the true ballots. Using these released ballots and the totals released by the BBWAA, this research assesses what can be learned about the group of voters who chose to not disclose their ballot. Attributes of the known and unknown ballot groups are studied by looking at differences in voting preference for individual players as well as voting differences between classes of voters that are defined using latent class analysis (LCA).

Original languageAmerican English
JournalMathematics and Statistics: Faculty Publications and Other Works
Volume13
Issue number1
DOIs
StatePublished - Mar 1 2017

Keywords

  • Baseball
  • Latent Class Analysis
  • Multiple Imputation
  • Statistical Disclosure

Disciplines

  • Mathematics

Cite this