Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 803 Bytes

README.md

File metadata and controls

7 lines (4 loc) · 803 Bytes

Investigating a Dataset- Baseball's Elite Hitters

Analysis of the top players in a given sport can help uncover useful insights for both the individual and team. In this analysis, I’ll primarily look at characteristics of “elite hitters.” Later in the analysis of elite hitters, I will make an investigation into the statistical impact of performance-enhancing drugs on the overall population of elite hitters. Numpy and Pandas modules were the primary libraries used to analyze the dataset.

About the data-

This is a data set containing complete batting and pitching statistics from 1871 to 2014, plus fielding statistics, standings, team stats, managerial records, post-season data, and more. A link to the dataset can be found here: http://www.seanlahman.com/baseball-archive/statistics/