Tuesday, January 15, 2019

Data manipulation with Python Part 2

In this post we would look at another Data Manipulation scenario. The nature of the task as well as resources have been summarized in the following sections:

Problem Statement: There is a sports accessories company ABC that sells sports gear across the globe. The data has fields such as Revenue, Quantity, Gross Margin, Order Method, Time, Country etc spread from 2012 to 2014 across 4 Quarters (Q1 through Q4). Based on the Global Outlook and growth forecasts made by economist at ABC, the company has decided to sell total units equal to 211,555,475 in 2015. However, this is an overall number and Product Managers for individual countries don't know how this number would drill down to individual Product. Your job as a Business Analyst is to help the Product Managers get the numbers so that they can plan effectively

Data Download Link

Learning Objectives:
  1. Using Pandas for data manipulation
    • Getting records and column names
    • Handle NA
    • Using Groupby to sum up numbers at different levels
  2. Using 'apply' function on certain columns
  3. Using lambda along with apply to get Percentage Share
  4. % formatting 
Please download the jupyter file from the following link

No comments:

Post a Comment

Word Cloud using R

Word Cloud Using R Word Cloud Using R 2024-09-16 Introduction I...