ICT702 Data Analysis Project For the Dataset Cleansing
Questions:
• Write programs that produce correct and useful output
• Apply relevant Python programming concepts to a data analysis challenge
• Read data from real sources and wrangle it into the form you need.
• Develop creative approaches to solving the wrangling/analysis problems.
• Adhere to the recommended Python programming styles
• Organise and present a data analysis report
• Give an insightful analysis of the given problem.
Answer:
Introduction
Data wrangling and analysis plays a very vital role in improving business performance in today’s completive business environment. Through the proper analysis of the data organizations can predict trends of sale as well as analysis of customer purchasing behaviours. Through the proper analysis and interpretation results can deliver important insights about the market scenario that can be used in meaningful ways. Using this insights, the organi
zation can improve business productivity with the effective decision-making.
Dataset cleansing and preparing
For this data analysis project, there are three datasets are provided which includes the following, Region.csv, StoreDetails.csv and WeeklySales.csv. First the WeeklySales dataset contains the information about the Store ID, Date and Weekly_Sales amount. StoreDetails.csv includes Store, Date, Temperature, Unemployment rate in the regions and finally the Region.csv file includes the store numbers in different regions.
In order to get the details of the sales form the different regions, sales and different other factors we merged the dataset depending the common column store in the different datasets.
Moreover, the merged dataset is also checked by the pandas package in order to remove the null values as well as inconsistency in the newly created dataset.
Insights from the dataset through the plots
In order to find the rate of unemployment and its relation with the sales of ice-cream in different regions we merged the datasets using the pandas package of python. The overall statistics of the merged dataset is provided in the following table,
|
Store |
Weekly_Sales |
Temperature |
Unemployment |
count |
6435.000000 |
6435.000000 |
6435.000000 |
6435.000000 |
mean |
23.000000 |
19213.485088 |
60.663782 |
7.999151 |
std |
12.988182 |
15102.373853 |
18.444933 |
1.875885 |
min |
1.000000 |
711.110000 |
-2.060000 |
3.879000 |
25% |
12.000000 |
10423.465000 |
47.460000 |
6.891000 |
50% |
23.000000 |
15314.910000 |
62.670000 |
7.874000 |
75% |
34.000000 |
23135.595000 |
74.940000 |
8.622000 |
max |
45.000000 |
172225.550000 |
100.140000 |
14.313000 |
For the merged dataset, there is total 6435 rows of data in it. For the weakly sales of the maximum mean and minimum value are provided by 172225.55, 19213.485 and 711.110. in case of temperature the maximum and minimum temperature in all the regions recorded are 100.14 and -2.060. at last the maximum, minimum and mean rate of unemployment (among all the regions) according to the dataset is provided by, 14.31,3.87 and 7.999.
From the weekly sales data sets, we found the following trend about the sales according to the different regions present in the data set
From the above plot it can be derived that the Region B has the maximum amount of the sales. The C region is at second position from the sales point of view. Following is plot that depicts the rate of un-employability in the different regions
Following chart shows the un-employability rates in different regions
By comparing the plots for the sales of the ice-cream in the different regions and the un-employability in different regions it can be said that for the region B, the sales of ice cream high with the unemployment rate. On the other hand, in case of region E, the sales of ice cream are lowest compared to the highest rate of unemployment.
Now to measure the performance of different stores in the different regions we plotted the sales of the ice cream grouping by the store numbers in the data sets.
From the above image, it can be stated that, complete merged dataset consists of total 45 stores in different regions. Accumulated maximum sales is done by the store number 13 whereas the lowest sales are by the store number 36.
Distribution of un-employability in the dataset is given by the following histogram plot;
From the above histogram plot, it can be said that the histogram is skewed right as there are only few values at the increasing value of the un employability. Thus it can be said that, the most general value for the un-employability in the regions are between 6 to 10.
Conclusion
Sales data analytics is considered as an important internal business function which is more than just presenting figures and sales numbers to the management in the organization. With much more in-depth methodologies in recording and crunching data the data about sales and other factors (having impact on the sales) can be presented in an easily-digestible format for a non-technical person too. Through the above analysis, the organization should try to improve its business in the regions (described in the sales analysis) by using proper strategy.
Buy ICT702 Data Analysis Project For the Dataset Cleansing Answers Online
Talk to our expert to get the help with ICT702 Data Analysis Project For the Dataset Cleansing Answers to complete your assessment on time and boost your grades now
The main aim/motive of the management assignment help services is to get connect with a greater number of students, and effectively help, and support them in getting completing their assignments the students also get find this a wonderful opportunity where they could effectively learn more about their topics, as the experts also have the best team members with them in which all the members effectively support each other to get complete their diploma assignments. They complete the assessments of the students in an appropriate manner and deliver them back to the students before the due date of the assignment so that the students could timely submit this, and can score higher marks. The experts of the assignment help services at urgenthomework.com are so much skilled, capable, talented, and experienced in their field of programming homework help writing assignments, so, for this, they can effectively write the best economics assignment help services.