Business Analytics Project - Real-World Scenario
PROJECT
The assessment task is the culmination of your journey into the world of business analytics and is designed to give you exposure to the real-world environment of business analytics and stimulate your creativity as well as your ability to solve business problems. You will work with real-world data to develop and evaluate a data model and develop a business case solution based on the analysis of data sets. (write a report 1500 words)
PURPOSE
The purpose of this assessment task is to:
- Demonstrate your data wrangling and visualization skills
- Demonstrate your knowledge and skills in developing models with a focus on classification and regression algorithms
- Evaluate developed models and identify bottlenecks and improvements
- Develop a solution to the business problem and communicate your findings to stakeholders
LEARNING OUTCOMES
The purpose of this assessment task is to:
- Critically analyse the role of business analytics in supporting decision making in a modern organisation, with a focus of working with different data formats and data wrangling techniques;
2. Investigate and assess different analytics solutions in open-source environments to develop effective visualisations;
3. Evaluate analytics models to uncover hidden patterns in business data and understand relationships between variables;
4. Deconstruct and exemplify data communication strategies through reproducible reporting and collaborative practices with version control; and,
5. Exemplify creative and innovative problem-solving of complex professional challenges through the application data analytics in the business domain. the purpose here
ASSESSMENT STRUCTURE
The assessment detail/structure/requirements here in a student centred tone.
PLEASE NOTE:
For this project you are provided with a real-world scenario, related dataset and some background information. You select the dataset, explore it and find useful “insights”. Those “insights” should address the business problem that the dataset relates to. This is where you can “unleash” your creative potential. You are not provided with specific instructions on data wrangling, viz or modelling but need to work with a problem and suggest a solution based on data. Make sure you show how your insights are helpful to the business as the ultimate goal is to use data to provide a recommendation. There are several parts in your project and you are required to describe each stage in the report that you develop for submission.
INSTRUCTIONS
To get started on your assessment task, please follow the below instructions.
- Review the case and identify the business problem to address. This problem should be worded as a QUESTION that addresses the identified business problem.
g. Question: How can we increase sales using the provided data? - It is recommended that you split this problem into sub-steps or sub-problems that would allow you to gradually build the solution.
g. Question: How can we increase sales using the provided data?
Subquestion 1: which customer characteristics predict the item they purchase?
Subquestion 2: how can the volume of sales be predicted using the item characteristics? - You need to have at least two subquestions: one will be solved using the regression model and another one will be solved using the classification model.
You need to demonstrate how this deconstruction addresses the overarching problem solution (i.e. your QUESTION). - Use data wrangling and data viz approaches to explore the dataset in relation to the identified QUESTION and SUBQUESTIONS. Each data manipulation and data viz needs to add value to resolve the identified problem and help with decision making.
This step forms the part of your written report which you prepare as a .rmd document and submit as part of your report.
You need to explain what your new insights your data wrangling and data viz provide. - Develop at least 2 models addressing the problem using regression and classification approaches. The development of the model should be based on your knowledge, additional research and the problem to address. It should NOT be based only on the availability of data.
Explain your reasoning in developing your models and justify them.
You must use at least one classification and one regression algorithm
This step forms the part of your written report which you prepare as a .rmd document and submit as part of your report. - Evaluate your developed models and discuss how they “answer” your QUESTION and SUBQUESTIONS and help to resolve the identified business problem.
This step forms the part of your written report which you prepare as a .rmd document and submit as part of your report. - Develop a communication document (business report) with your findings for your stakeholder/s. You need to report your findings in easy to understand and visually supported way - this IS your .rmd report which should be prepared using business report style.
- For tips and hints on how to write a business report watch thisvideo opens in new window
- Your conclusion should be a STORY TELLING, rather than a report on steps completed. You need to present management with options to resolve the identified problem.
Presentation
- Use PPT or any other suitable software to create slides for your presentation
- Record this presentation as a video pitch (10min) usingPanopto opens in new window, Zoom or any other suitable software
- Present your data analysis with a focus on telling the story of the data
- Make sure you address the problem of your report
- Communicate actionable insights for the wider business
- Pitch the above insights to the intended stakeholders
ASSESSMENT CRITERIA
The following levels of criteria will be used to grade this assessment task:
- Business Report
- Data Wrangling - Rationale and stakeholder question, data exploration 30%
- Data Visualization - Communication of Insights & Solutions 20%
- Data Analysis, modelling 30%
- Cohesive group work and equitable distribution of the workload 10%
- Writing conventions following Business Report format 10%
- Presentation
- Data Storytelling – 40%
- Effective use of visuals – 20%
- Clarity of structure and flow, (introduction, conclusion) – 20%
- Delivery, engagement with audience and time management - 20%
Here is the data
parks <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2021/2021-06-22/parks.csv')
Requirements:
- I need RMarkdown with all the data wrangle and visualizations codes that we are going to use in the report and the presentations. I need them to be able to answer the following questions as a plot only, because we are going to present them:
- Is there a big difference in government spending in the years leading up to 2020? See is there any impact on green space spending by Covid-19.
- Show summary for cities with a total of less than 120 A balance of green space must be considered, does this meet the requirements of the residents? Are there some cities that need improvement? What will we do next?
- Create a group of cities with highest Spending per resident in USA before and after covid-19
- Create a group of cities with lowest Spending per resident in USA before and after covid-19
- Which three states have the large amenities_points? Are those cities have more park sizes as well? Which three states have the lowest amenities_points? And compare the dog park accessibility. Are those cities have fewer park sizes as well? What’s the difference? What we should do next.
- Which are “Green Cities” that obtained a percentage above 75% for Percent of residents. Which cities have more than 20% in parkland percentage?
- Create ppt presentation related to all questions above with charts and illustrations plus script.
- I need all the graphs to be attached in the ppt presentations with some notes as same as the video that I have sent. I need them to be in 6 slides to 7 slides, including the contents and the background slide. This assessment is part of my work with a group, and they will be start after I finish my part. My part is all related to introductions, descriptive statistics, and comparison between years and some variables.
- I need the RMarkdown and the presentations with the scripts ASAP at least within 24 hours.
- I need the report to be ready by Monday (1500 words).
- NOTE: all these questions must only these next libraies (ggplot2) (skimr) ( randomForest) (lm) (felter ! is.na) (tidyverse) (dplyr) (widyr)
Expert's Answer
Chat with our Experts
Want to contact us directly? No Problem. We are always here for you
Get Online
Online Tutoring Services