| Type: | Package |
| Title: | Professional Sports Draft Data |
| Version: | 1.0.3 |
| Author: | Benjamin Ginsburg [aut, cre] |
| Maintainer: | Benjamin Ginsburg <benjamin.ginsburg@du.edu> |
| Description: | We provide comprehensive draft data for major professional sports leagues, including the National Football League (NFL), National Basketball Association (NBA), and National Hockey League (NHL). It offers access to both historical and current draft data, allowing for detailed analysis and research on player biases and player performance. The package is useful for sports fans and researchers interested in identifying biases and trends within scouting reports. Created by web scraping data from leading websites that cover professional sports player scouting reports, the package allows users to filter and summarize data for analytical purposes. For further details on the methods used, please refer to Wickham (2022) "rvest: Easily Harvest (Scrape) Web Pages" https://CRAN.R-project.org/package=rvest and Harrison (2023) "RSelenium: R Bindings for Selenium WebDriver" https://CRAN.R-project.org/package=RSelenium. |
| Encoding: | UTF-8 |
| LazyData: | true |
| RoxygenNote: | 7.3.2 |
| URL: | https://github.com/Ginsburg1/ProSportsDraftData |
| BugReports: | https://github.com/Ginsburg1/ProSportsDraftData/issues |
| Imports: | dplyr, stats |
| Suggests: | knitr, rmarkdown, testthat (≥ 3.0.0) |
| Config/testthat/edition: | 3 |
| Depends: | R (≥ 3.5.0) |
| VignetteBuilder: | knitr |
| License: | GPL-3 |
| NeedsCompilation: | no |
| Packaged: | 2024-08-28 22:16:16 UTC; benjaminginsburg |
| Repository: | CRAN |
| Date/Publication: | 2024-09-03 14:30:08 UTC |
ProSportsDraftData: Professional Sports Draft Data
Description
The 'ProSportsDraftData' package provides comprehensive draft data for major professional sports leagues, including the National Football League (NFL), National Basketball Association (NBA), and National Hockey League (NHL). It offers access to both historical and current draft data, allowing for detailed analysis and research on player biases and player performance. The package is useful for sports fans and researchers interested in identifying biases and trends within scouting reports. Created by web scraping data from leading websites that cover professional sports player scouting reports, the package allows users to filter and summarize data for analytical purposes. For further details on the methods used, please refer to Wickham (2022) "rvest: Easily Harvest (Scrape) Web Pages" https://CRAN.R-project.org/package=rvest and Harrison (2023) "RSelenium: R Bindings for Selenium WebDriver" https://CRAN.R-project.org/package=RSelenium.
Author(s)
Maintainer: Benjamin Ginsburg benjamin.ginsburg@du.edu
See Also
Useful links:
Report bugs at https://github.com/Ginsburg1/ProSportsDraftData/issues
NBA Draft Data
Description
A dataset of all first-round picks in the NBA, including various draft metrics.
Usage
nba_data
Format
A tibble with the following columns:
sourceThe source of the data.
nameThe name of the player.
yearThe year of the draft.
rankThe rank of the player in the draft.
roundThe round in which the player was drafted.
heightThe height of the player.
weightThe weight of the player.
schoolThe school the player attended.
college_abbreviationThe abbreviation of the college the player attended.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
verdictThe overall verdict on the player's potential.
pre_draft_analysisAnalysis of the player before the draft.
post_draft_analysisAnalysis of the player after the draft.
rankingThe player's ranking.
player_scoreThe score given to the player.
positionThe position the player plays.
gradeThe grade given to the player.
pts_totalTotal points scored by the player.
pts_per40Points per 40 minutes played.
reb_totalTotal rebounds by the player.
reb_per40Rebounds per 40 minutes played.
ast_totalTotal assists by the player.
ast_per40Assists per 40 minutes played.
efgEffective field goal percentage.
fgaField goal attempts.
stl_totalTotal steals by the player.
stl_per40Steals per 40 minutes played.
blk_totalTotal blocks by the player.
blk_per40Blocks per 40 minutes played.
tptThree-point shots made.
tpaThree-point attempts.
ftFree throws made.
ftaFree throw attempts.
tsTrue shooting percentage.
ts_attTrue shooting attempts.
main_selling_pointThe main selling point of the player.
ageThe age of the player at the time of the draft.
wingspanThe wingspan of the player.
standing_reachThe standing reach of the player.
similar_playerA player with a similar profile.
profileProfile description of the player.
analysisAn analysis of the player's potential and performance.
projectionProjection of the player's future performance.
date_of_birthThe date of birth of the player.
hometownThe hometown of the player.
high_schoolThe high school the player attended.
reboundingEvaluation of the player's rebounding skills.
potentialEvaluation of the player's potential.
post_skillsEvaluation of the player's post skills.
intangiblesEvaluation of the player's intangibles.
international_teamThe international team the player has been part of.
athleticismEvaluation of the player's athleticism.
sizeEvaluation of the player's size.
defenseEvaluation of the player's defensive skills.
strengthEvaluation of the player's strength.
quicknessEvaluation of the player's quickness.
leadershipEvaluation of the player's leadership skills.
jumpshotEvaluation of the player's jumpshot.
nba_readyEvaluation of the player's NBA readiness.
mock_draft_rankThe player's ranking in mock drafts.
big_board_rankThe player's ranking on big boards.
overall_rankThe player's overall ranking.
Examples
# Load the dataset
data(nba_data)
# View the first few rows
head(nba_data)
# View players who attended Duke
library(dplyr)
duke_players <- nba_data |>
filter(college_abbreviation == "DUKE")
print(duke_players)
# Average score by position
avg_score_by_position <- nba_data |>
group_by(position) |>
summarise(avg_score = mean(player_score, na.rm = TRUE))
print(avg_score_by_position)
# Calculate the average points per 40 minutes
avg_pts_per40 <- nba_data |>
summarise(avg_pts = mean(pts_per40, na.rm = TRUE))
print(avg_pts_per40)
# Count the number of players from each college
college_counts <- nba_data |>
group_by(college_abbreviation) |>
summarise(count = n())
print(college_counts)
NBA Data Base
Description
Filter NBA Data by Source (Base)
Usage
nba_data_base()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe year of the draft.
roundThe round in which the player was drafted.
Details
Filters and selects NBA data from the Base (ESPN.com) source.
Value
A filtered and selected tibble of NBA data from Base.
Examples
# Filter NBA data for base source
base_data <- nba_data_base()
# View the first few rows
head(base_data)
NBA Data ESPN
Description
Filter NBA Data by Source (ESPN)
Usage
nba_data_espn()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe year of the draft.
heightThe height of the player.
weightThe weight of the player.
schoolThe school the player attended.
college_abbreviationThe abbreviation of the college the player attended.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
verdictThe overall verdict on the player's potential.
pre_draft_analysisAnalysis of the player before the draft.
post_draft_analysisAnalysis of the player after the draft.
rankingThe player's ranking.
player_scoreThe score given to the player.
Details
Filters and selects NBA data from ESPN for the given source value.
Value
A filtered and selected tibble of NBA data from ESPN.
Examples
# Filter NBA data for ESPN
espn_data <- nba_data_espn()
# View the first few rows
head(espn_data)
NBA Data NBA.com
Description
Filter NBA Data by Source (NBA.com)
Usage
nba_data_nba_com()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe year of the draft.
positionThe position the player plays.
schoolThe school the player attended.
weightThe weight of the player.
gradeThe grade given to the player.
date_of_birthThe date of birth of the player.
profileProfile description of the player.
analysisAn analysis of the player's potential and performance.
projectionProjection of the player's future performance.
Details
Filters and selects NBA data from NBA.com for the given source value.
Value
A filtered and selected tibble of NBA data from NBA.com.
Examples
# Filter NFL data for NBA.com
nba_com <- nba_data_nba_com()
# View the first few rows
head(nba_com)
NBA Data NBADraft.net
Description
Filter NBA Data by Source (NBADraft.net)
Usage
nba_data_nbadraft_net()
Format
A tibble with the following columns:
nameThe name of the player.
heightThe height of the player.
weightThe weight of the player.
positionThe position the player plays.
schoolThe school the player attended.
date_of_birthThe date of birth of the player.
hometownThe hometown of the player.
high_schoolThe high school the player attended.
international_teamThe international team the player has been part of.
athleticismEvaluation of the player's athleticism.
sizeEvaluation of the player's size.
defenseEvaluation of the player's defensive skills.
strengthEvaluation of the player's strength.
quicknessEvaluation of the player's quickness.
leadershipEvaluation of the player's leadership skills.
jumpshotEvaluation of the player's jumpshot.
nba_readyEvaluation of the player's NBA readiness.
reboundingEvaluation of the player's rebounding skills.
potentialEvaluation of the player's potential.
post_skillsEvaluation of the player's post skills.
intangiblesEvaluation of the player's intangibles.
mock_draft_rankThe player's ranking in mock drafts.
big_board_rankThe player's ranking on big boards.
overall_rankThe player's overall ranking.
similar_playerA player with a similar profile.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
Details
Filters and selects NBA data from NBADraft.net for the given source value.
Value
A filtered and selected tibble of NBA data from NBADraft.net.
Examples
# Filter NFL data for NBADraft.net
nbadraft <- nba_data_nbadraft_net()
# View the first few rows
head(nbadraft)
NBA Data The Ringer
Description
Filter NBA Data by Source (The Ringer)
Usage
nba_data_the_ringer()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe year of the draft.
positionThe position the player plays.
schoolThe school the player attended.
gradeThe grade given to the player.
pts_totalTotal points scored by the player.
pts_per40Points per 40 minutes played.
reb_totalTotal rebounds by the player.
reb_per40Rebounds per 40 minutes played.
ast_totalTotal assists by the player.
ast_per40Assists per 40 minutes played.
efgEffective field goal percentage.
fgaField goal attempts.
stl_totalTotal steals by the player.
stl_per40Steals per 40 minutes played.
blk_totalTotal blocks by the player.
blk_per40Blocks per 40 minutes played.
tptThree-point shots made.
tpaThree-point attempts.
ftFree throws made.
ftaFree throw attempts.
tsTrue shooting percentage.
ts_attTrue shooting attempts.
main_selling_pointThe main selling point of the player.
ageThe age of the player at the time of the draft.
heightThe height of the player.
weightThe weight of the player.
wingspanThe wingspan of the player.
standing_reachThe standing reach of the player.
analysisAn analysis of the player's potential and performance.
similar_playerA player with a similar profile.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
Details
Filters and selects NBA data from The Ringer for the given source value.
Value
A filtered and selected tibble of NBA data from The Ringer.
Examples
# Filter NFL data for The Ringer
ringer_data <- nba_data_the_ringer()
# View the first few rows
head(ringer_data)
NFL Draft Data
Description
A dataset of all first-round picks in the NFL, including various draft metrics.
Usage
nfl_data
Format
A tibble with the following columns:
sourceThe source of the data.
nameThe name of the player.
yearThe year of the draft.
rankThe rank of the player.
roundThe round in which the player was drafted.
heightThe height of the player.
weightThe weight of the player.
positionThe position of the player.
collegeThe college the player attended.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
similar_playerA similar player for comparison.
summaryA summary of the player's abilities.
arm_lengthThe arm length of the player.
hand_lengthThe hand length of the player.
next_gen_production_scoreThe Next Gen production score.
next_gen_athleticism_scoreThe Next Gen athleticism score.
forty_yard_dashThe forty-yard dash time.
vertical_jumpThe vertical jump height.
nfl_prospect_gradeThe NFL prospect grade.
home_townThe hometown of the player.
broad_jumpThe broad jump distance.
three_cone_drillThe three-cone drill time.
twenty_yard_shuttleThe twenty-yard shuttle time.
bench_pressThe bench press reps.
college_abbrivationThe abbreviation of the college.
pre_draftPre-draft information.
post_draftPost-draft information.
position_rankThe position rank of the player.
overall_rankThe overall rank of the player.
gradeThe grade of the player.
schoolThe school the player attended.
ydsYards the player ran.
ypaThe yards per attempt.
yprThe yards per reception.
tdsNumber of touchdowns the player performed.
intsThe interceptions.
rtgThe rating of the player.
tklsThe number of taclees of the player.
tflThe tackles for loss.
ypcThe yards per carry.
pbuThe pass break-ups of the player.
twenty_plusPlays of twenty or more yards.
sacksNumber of sacks of the player.
gmsThe number of games played.
strtsThe number of games started.
sk_allThe number of sack allowed.
ageThe age of the player.
main_selling_pointThe main selling point of the player.
descriptionThe description of the player.
scouting_reportThe scouting report.
scorePlayers score from 1-100.
Examples
# Load the dataset
data(nfl_data)
# View the first few rows
head(nfl_data)
# Filter data for NFL.com source
nfl_com_data <- nfl_data[nfl_data$source == "NFL.com", ]
# Filter data for The Ringer source
the_ringer_data <- nfl_data[nfl_data$source == "The Ringer", ]
NFL Data Base
Description
Filter NFL Data by Source (Base)
Usage
nfl_data_base()
Format
A tibble with the following columns:
nameThe name of the player.
roundThe round in which the player was drafted.
rankThe rank of the player.
Details
Filters and selects NFL data from the base source for the given source value.
Value
A filtered and selected tibble of NFL data.
Examples
# Filter NFL data for base source
base_data <- nfl_data_base()
# View the first few rows
head(base_data)
NFL Data ESPN
Description
Filter NFL Data by Source (ESPN)
Usage
nfl_data_espn()
Format
A tibble with the following columns:
sourceThe source of the data.
nameThe name of the player.
yearThe year of the draft.
heightThe height of the player.
weightThe weight of the player.
collegeThe college the player attended.
college_abbrivationThe abbreviation of the college.
pre_draftPre-draft information.
post_draftPost-draft information.
position_rankThe position rank of the player.
overall_rankThe overall rank of the player.
scoreThe player's score from 1-100.
Details
Filters and selects NFL data from ESPN for the given source value.
Value
A filtered and selected tibble of NFL data from ESPN.
Examples
# Filter NFL data for ESPN source
espn_data <- nfl_data_espn()
# View the first few rows
head(espn_data)
NFL Data NFL.com
Description
Filter NFL Data by Source (NFL.com)
Usage
nfl_data_nfl_com()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe draft year of the player.
heightThe height of the player.
weightThe weight of the player.
positionThe position of the player.
collegeThe college the player attended.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
similar_playerA similar player for comparison.
summaryA summary of the player's abilities.
Details
Filters and selects NFL data from NFL.com for the given source value.
Value
A filtered and selected tibble of NFL data from NFL.com.
Examples
# Filter NFL data for NFL.com
nfl_data <- nfl_data_nfl_com()
# View the first few rows
head(nfl_data)
NFL Data The Ringer
Description
Filter NFL Data by Source (The Ringer)
Usage
nfl_data_the_ringer()
Format
A tibble with the following columns:
nameThe name of the player.
rankThe rank of the player.
yearThe draft year of the player.
positionThe position of the player.
collegeThe college the player attended.
gradeThe grade of the player.
ydsThe number of yards the player ran.
ypaThe yards per attempt.
yprThe yards per reception.
tdsThe number of touchdowns by the player.
intsThe number of interceptions.
rtgThe rating of the player.
tklsThe number of tackles by the player.
tflThe number of tackles for loss.
ypcThe yards per carry.
pbuThe number of pass break-ups by the player.
twenty_plusThe number of plays of twenty or more yards.
sacksThe number of sacks by the player.
gmsThe number of games played.
strtsThe number of games started.
sk_allThe number of sacks allowed by the player.
heightThe height of the player.
weightThe weight of the player.
ageThe age of the player.
main_selling_pointThe main selling point of the player.
descriptionA description of the player.
similar_playerA similar player for comparison.
scouting_reportThe scouting report of the player.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
Details
Filters and selects NFL data from The Ringer for the given source value.
Value
A filtered and selected tibble of NFL data from The Ringer.
Examples
# Filter NFL data for The Ringer
ringer_data <- nfl_data_the_ringer()
# View the first few rows
head(ringer_data)
NFL Data Walter Football
Description
Filter NFL Data by Source (Walter Football)
Usage
nfl_data_walter_football()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe draft year of the player.
heightThe height of the player.
weightThe weight of the player.
arm_lengthThe arm length of the player.
hand_lengthThe hand length of the player.
collegeThe college the player attended.
positionThe position of the player.
next_gen_production_scoreThe Next Gen production score.
next_gen_athleticism_scoreThe Next Gen athleticism score.
forty_yard_dashThe forty-yard dash time.
vertical_jumpThe vertical jump height.
nfl_prospect_gradeThe NFL prospect grade.
home_townThe hometown of the player.
broad_jumpThe broad jump distance.
three_cone_drillThe three-cone drill time.
twenty_yard_shuttleThe twenty-yard shuttle time.
bench_pressThe bench press reps.
similar_playerA similar player for comparison.
summaryA summary of the player's abilities.
prosThe pros of the player's abilities.
consThe cons of the player's abilities.
Details
Filters and selects NFL data from Walter Football for the given source value.
Value
A filtered and selected tibble of NFL data from Walter Football.
Examples
# Filter NFL data for Walter Football source
walter_data <- nfl_data_walter_football()
# View the first few rows
head(walter_data)
NHL Draft Data
Description
A dataset of all first-round picks in the NHL, including various draft metrics.
Usage
nhl_data
Format
A tibble with the following columns:
sourceThe source of the data.
nameThe name of the player.
yearThe year of the draft.
rankThe rank of the player in the draft.
roundThe round in which the player was drafted.
date_of_birthThe date of birth of the player.
ageThe age of the player at the time of the draft.
current_teamThe current team of the player.
leagueThe league the player was playing in before the draft.
scouting_reportA scouting report of the player.
team_fitHow well the player fits with the drafting team.
likely_to_playThe likelihood of the player playing in the NHL.
positionThe position the player plays.
cityThe city associated with the player or team.
teamThe team that selected the player.
analysisAn analysis of the player's potential and performance.
Examples
# Load the dataset
data(nhl_data)
# View the first few rows
head(nhl_data)
# Filter the data for players drafted in a specific year, e.g., 2023
nhl_2023 <- dplyr::filter(nhl_data, year == 2023)
# View the first few rows
head(nhl_2023)
# Filter the data to include only ESPN as the source
nhl_data |>
dplyr::filter(source == "ESPN")
NHL Data Base
Description
Filter NHL Data by Source (Base)
Usage
nhl_data_base()
Format
A tibble with the following columns:
nameThe name of the player.
roundThe round in which the player was drafted.
rankThe rank of the player in the draft.
Details
Filters and selects NHL data from the base source for the given source value.
Value
A filtered and selected tibble of NHL data.
Examples
# Filter NHL data for base source
base_data <- nhl_data_base()
# View the first few rows
head(base_data)
NHL Data ESPN
Description
Filter NHL Data by Source (ESPN)
Usage
nhl_data_espn()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe year of the draft.
date_of_birthThe date of birth of the player.
ageThe age of the player at the time of the draft.
current_teamThe current team of the player.
leagueThe league the player was playing in before the draft.
scouting_reportA scouting report of the player.
team_fitHow well the player fits with the drafting team.
likely_to_playThe likelihood of the player playing in the NHL.
Details
Filters and selects NHL data from ESPN for the given source value.
Value
A filtered and selected tibble of NHL data from ESPN.
Examples
# Filter NHL data for ESPN
espn_data <- nhl_data_espn()
# View the first few rows
head(espn_data)
NHL Data NHL.com
Description
Filter NHL Data by Source (NHL.com)
Usage
nhl_data_nhl.com()
Format
A tibble with the following columns:
nameThe name of the player.
yearThe year of the draft.
positionThe position the player plays.
cityThe city associated with the player or team.
teamThe team that selected the player.
leagueThe league the player was playing in before the draft.
scouting_reportA scouting report of the player.
analysisAn analysis of the player's potential and performance.
Details
Filters and selects NHL data from NHL.com for the given source value.
Value
A filtered and selected tibble of NHL data from NHL.com.
Examples
# Filter NHL data for nhl.com
nhl_com_data <- nhl_data_nhl.com()
# View the first few rows
head(nhl_com_data)