Categories
Data analysis and reports

Explain the data cleaning process step-by-step.

USE RStuido
Use the dataset “Crimes_-_2001_to_Present” AND Requirement to answer the TWO questions in Rmarkdown (IMPORTANT: including code in Rmd script):
Requirements:
1. Clean the dataset properly (remember the tidy data principles when wrangling the data) Provide a snapshot of the dataset in your output (e.g., create a proper table showing the data). Need to show how they are structured, variables, etc
2. Explain the data cleaning process step-by-step.
3. Explicitly identify the unit of analysis when describing the data.
4. Change column names if they are not intuitive
5. Check out the missing data and explain how to address them (if applicable)
6. Do not include steps that are not replicable or done in Excel
Coding Requirement:
1. Use pipe annotations (not the dollar sign framework) as much as possible.
2. Create functions if / when needed.
3. Use loops to Optimize the work.
Data visualization Requirements:
1. Create your own theme for plots.
2. Comment on the results below each graph.
3. Include both explorative and explicative plots as well as tables:
1. Have explicative plots that showcase your main findings along with a few other explorative plots to showcase the distributions and frequencies of your variables. 2. Descriptive statistics in either table or plot form. Talk about the main variable distribution or frequency.
4. Check all elements in plots and tables and make sure they are intuitive for someone who doesn’t know your project – x and y axes labels, titles, notes and captions.
5. use aesthetically pleasant color in table and plots. Use color palettes.
Packages:
tidyverse
tidytuesdayR
paletteer
packges may use:
purrr
tinytex
extrafont
forcats
RColorBrewer
rcartocolor
ggrepel
(or more packages)
Question 1: What has changed in Chicago’s crime rates from 2019 to 2022?
Requirement: Summary the stats in a table. Use the dataset to create a line graph that shows the criminal case number by year and month from 2019 to 2021.
Question 2: How does Chicago’s crime rate relate to location, which areas have higher crime rates?
Requirement: Create a table that shows the crime cases in the top 10 high-crime-rate districts in Chicago. Then create a heatmap to show the Crime Type vs. Month Crossover Heat Map from 2019 to 2020. Expect to explore the relationship between different crime types and months.
Here attached the link of the data sources ( Or search Crimes – 2001 to Present – Chicago Data Portal):
https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-Present/ijzp-q8t2/data

Categories
Data analysis and reports

How does spending score change with age/annual income?

Use mall_customer data file to perform clustering.
Column 1: Customer ID
Column 2: Gender
Column 3: Age
Column 4: Annual Income
Column 5: Spending score (the magic number that you get from customer profile)
Use your analysis to answer the following questions. Your answer should be supported by your analysis.
1. Everything being equal, do males or females have higher spending score?
2. How does spending score change with age/annual income?
3. What are the common features for those with high spending scores?

Categories
Data analysis and reports

The submission will consist of an Excel workbook (or an R script file if R has b

The submission will consist of an Excel workbook (or an R script file if R has been used) and a Word document– a minimum of two submissions that have been submitted as attachments.

Categories
Data analysis and reports

All working must be shown for all questions. For questions which ask you to writ

All working must be shown for all questions. For questions which ask you to write a program, you must provide the code you used in-text please. In text code would be fine. And please tell me should I motified or change anything from the file that I got, I am not fimiliar with coding.
The loss function is stated in chapter 2.8.
All the academic skills that you need is in the book.
Thanks

Categories
Data analysis and reports

Please use Eview to answer the questions. IMPORTANT: For all the unit-root tests

Please use Eview to answer the questions.
IMPORTANT:
For all the unit-root tests above, carefully state the (i) null and the alternative
hypothesis, (ii) test statistics and the corresponding critical value, (iii) Decision rule and (iv) Conclusion.
I uploaded the examples please strictly follow the hypothesis examples.
For the Eview tables you can just do a simply screen shot or something.

Categories
Data analysis and reports

Exam 2 Instruction answers are what I need help with, there are a number of ques

Exam 2 Instruction answers are what I need help with, there are a number of questions within. If I could get screenshots to help guide my way through their answering that would be awesome. I want to be able to run the reports myself. I have uploaded the two .csv, “training” and “prediction”. Finally, I included HW8, because it is mentioned within the assignment, if you need it.

Categories
Data analysis and reports

Submissions should consist of an (i) Excel workbook, (ii) an R script file, (iii

Submissions should consist of an (i) Excel workbook, (ii) an R script file, (iii) and a Word document – a total of 3 files. Please attach all files when submitting the project. Part I of this project should be completed in both Excel and R. Part II should be completed only
in R. In the Word document, write a formal report to the management, summarizing the results
that you have obtained