Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report

Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report ORDER NOW FOR CUSTOMIZED AND ORIGINAL ESSAY PAPERS ON Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report In this homework assignment, you will explore, analyze and model a data set containing approximately 2200 records. Each record represents a professional baseball team from the years 1871 to 2006 inclusive. Each record has the performance of the team for the given year, with all of the statistics adjusted to match the performance of a 162 game season. Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report Your objective is to build a multiple linear regression model on the training data to predict the number of wins for the team. You can only use the variables given to you (or variables that you derive from the variables provided). A write-up submitted in PDF format. Your write-up should have four sections. Each one is described in the Assignment Requirements document. You may assume you are addressing me as a fellow data scientist, so do not need to shy away from technical details. Assigned predictions (the number of wins for the team) for the evaluation data set. Include your R statistical programming code in an Appendix. adec_7320_homework__1__moneyball_.pdf ADEC 7320 – Econometrics Homework #1 Assignment Requirements Overview In this homework assignment, you will explore, analyze and model a data set containing approximately 2200 records. Each record represents a professional baseball team from the years 1871 to 2006 inclusive. Each record has the performance of the team for the given year, with all of the statistics adjusted to match the performance of a 162 game season. Your objective is to build a multiple linear regression model on the training data to predict the number of wins for the team. You can only use the variables given to you (or variables that you derive from the variables provided). Below is a short description of the variables of interest in the data set: VARIABLE NAME INDEX TARGET_WINS TEAM_BATTING_H TEAM_BATTING_2B TEAM_BATTING_3B TEAM_BATTING_HR TEAM_BATTING_BB TEAM_BATTING_HBP TEAM_BATTING_SO TEAM_BASERUN_SB TEAM_BASERUN_CS TEAM_FIELDING_E TEAM_FIELDING_DP TEAM_PITCHING_BB TEAM_PITCHING_H TEAM_PITCHING_HR TEAM_PITCHING_SO DEFINITION . Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report Identification Variable (do not use) Number of wins Base Hits by batters (1B,2B,3B,HR) Doubles by batters (2B) Triples by batters (3B) Homeruns by batters (4B) Walks by batters Batters hit by pitch (get a free base) Strikeouts by batters Stolen bases Caught stealing Errors Double Plays Walks allowed Hits allowed Homeruns allowed Strikeouts by pitchers THEORETICAL EFFECT None Positive Impact on Wins Positive Impact on Wins Positive Impact on Wins Positive Impact on Wins Positive Impact on Wins Positive Impact on Wins Negative Impact on Wins Positive Impact on Wins Negative Impact on Wins Negative Impact on Wins Positive Impact on Wins Negative Impact on Wins Negative Impact on Wins Negative Impact on Wins Positive Impact on Wins Deliverables: • A write-up submitted in PDF format. Your write-up should have four sections. Each one is described below. You may assume you are addressing me as a fellow data scientist, so do not need to shy away from technical details. • Assigned predictions (the number of wins for the team) for the evaluation data set. • Include your R statistical programming code in an Appendix. Write Up: 1. DATA EXPLORATION (50 Points) Describe the size and the variables in the moneyball training data set. Consider that too much detail will cause a manager to lose interest while too little detail will make the manager consider that you aren’t doing your job. Some suggestions are given below. Please do NOT treat this as a check list of things to do to complete the assignment. You should have your own thoughts on what to tell the boss. These are just ideas. a. Mean / Standard Deviation / Median b. Bar Chart or Box Plot of the data c. Is the data correlated to the target variable (or to other variables?) d. Are any of the variables missing and need to be imputed “fixed”? 2. DATA PREPARATION (50 Points) Describe how you have transformed the data by changing the original variables or creating new variables. If you did transform the data or create new variables, discuss why you did this. Here are some possible transformations. Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report a. Fix missing values (maybe with a Mean or Median value) b. Create flags to suggest if a variable was missing c. Transform data by putting it into buckets d. Mathematical transforms such as log or square root (or use Box-Cox) e. Combine variables (such as ratios or adding or multiplying) to create new variables 3. BUILD MODELS (50 Points) Using the training data set, build at least three different multiple linear regression models, using different variables (or the same variables with different transformations). Since we have not yet covered automated variable selection methods, you should select the variables manually (unless you previously learned Forward or Stepwise selection, etc.). Since you manually selected a variable for inclusion into the model or exclusion into the model, indicate why this was done. Discuss the coefficients in the models, do they make sense? For example, if a team hits a lot of Home Runs, it would be reasonably expected that such a team would win more games. Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report However, if the coefficient is negative (suggesting that the team would lose more games), then that needs to be discussed. Are you keeping the model even though it is counter intuitive? Why? The boss needs to know. 4. SELECT MODELS (50 Points) Decide on the criteria for selecting the best multiple linear regression model. Will you select a model with slightly worse performance if it makes more sense or is more parsimonious? Discuss why you selected your model. For selecting the best multiple linear regression model, will you use a metric such as Adjusted R2, RMSE, etc.? Be sure to explain how you can make inferences from the model, discuss multi-collinearity issues (if any), and discuss other relevant model output. Using the training data set, evaluate the multiple linear regression model based on (a) mean squared error, (b) R2, (c) F-statistic, and (d) residual plots. Make predictions using the evaluation data set. … Get a 10 % discount on an order above $ 100 Use the following coupon code : NURSING10

RECOMMENDED: NURS 4010 CU Interdisciplinary Team Approach & Patient Issues Stakeholder Presentation

Don't use plagiarized sources. Get Your Custom Essay on
Boston Moneyball ADEC 7320 Predict Number of Wins for Team Model Report
Get a 15% discount on this Paper
Order Essay

homeworkhelp

Quality Guaranteed

With us, you are either satisfied 100% or you get your money back-No monkey business

Check Prices
Make an order in advance and get the best price
Pages (550 words)
$0.00
*Price with a welcome 15% discount applied.
Pro tip: If you want to save more money and pay the lowest price, you need to set a more extended deadline.
We know that being a student these days is hard. Because of this, our prices are some of the lowest on the market.

Instead, we offer perks, discounts, and free services to enhance your experience.
Sign up, place your order, and leave the rest to our professional paper writers in less than 2 minutes.
step 1
Upload assignment instructions
Fill out the order form and provide paper details. You can even attach screenshots or add additional instructions later. If something is not clear or missing, the writer will contact you for clarification.
s
Get personalized services with My Paper Support
One writer for all your papers
You can select one writer for all your papers. This option enhances the consistency in the quality of your assignments. Select your preferred writer from the list of writers who have handledf your previous assignments
Same paper from different writers
Are you ordering the same assignment for a friend? You can get the same paper from different writers. The goal is to produce 100% unique and original papers
Copy of sources used
Our homework writers will provide you with copies of sources used on your request. Just add the option when plaing your order
What our partners say about us
We appreciate every review and are always looking for ways to grow. See what other students think about our do my paper service.
Nursing
Always perfect! Thank you!!!
Customer 452453, April 15th, 2021
IT, Web
Great job on the paper!!
Customer 452885, January 30th, 2023
Nursing
Always perfectly done!
Customer 452955, October 28th, 2023
Financial Statement Analysis
Thanks for the help
Customer 452821, March 10th, 2023
Other
Great job
Customer 452813, July 27th, 2023
Web programming
outstanding!
Customer 452715, September 16th, 2022
Other
GOOD
Customer 452813, July 5th, 2022
ENVIRONMENT SCIENCE
great
Customer 452813, June 26th, 2022
Nursing
They so amazing work!!
Customer 452707, January 29th, 2023
Nursing
Excellent work!! Thanks again!
Customer 452707, October 14th, 2022
Marketing
Yes and thank you
Customer 452701, October 25th, 2022
Nursing
Amazing work! I passed the assignment!
Customer 452707, August 20th, 2022
Enjoy affordable prices and lifetime discounts
Use a coupon FIRST15 and enjoy expert help with any task at the most affordable price.
Order Now Order in Chat

We now help with PROCTORED EXAM. Chat with a support agent for more details