Big Data Case Study (1)
Big Data Case Study (1)
1. Introduction
2. Problem Statement
● Challenges: The firm needs to process and analyze large volumes of structured and
unstructured data generated by millions of Facebook Marketplace users and listings.
● Impact: Difficulty in providing accurate, actionable insights for platform
improvements and seller strategies.
3. Data Collection
4. Data Processing
● Technologies Used:
○ Storage: Amazon S3 for scalable cloud storage.
○ Processing: Apache Spark for distributed data processing.
○ Ingestion: AWS Kinesis for real-time data ingestion.
● Methods: Data extraction using Facebook Marketplace API, data cleaning, and
feature extraction (e.g., listing popularity, buyer behavior metrics, seller performance
indicators).
5. Data Analysis
● Techniques:
○ Descriptive Analytics: To understand general marketplace trends and user
behavior patterns.
○ Predictive Analytics: To forecast demand for different product categories and
price points.
○ Sentiment Analysis: To analyze user feedback and comments.
● Tools: Python (Pandas, NumPy, NLTK), TensorFlow for machine learning, Tableau
for visualization.
6. Findings
● User Buying Patterns: Identified peak times for purchases and high-activity
categories.
● Listing Performance: Determined factors that contribute to successful listings (e.g.,
quality of images, detailed descriptions).
● Geographic Insights: Found significant differences in product preferences and
pricing across different regions.
● User Segments: Categorized users into segments such as "frequent buyers,"
"occasional sellers," and "power sellers."
7. Implementation
● Actionable Steps:
○ Listing Optimization: Advised sellers on best practices for creating attractive
listings.
○ Pricing Strategy: Developed a dynamic pricing model based on demand and
regional preferences.
○ User Experience: Recommended platform improvements to enhance search
and discovery features.
● Challenges: Balancing user privacy with data analysis needs, ensuring fair
marketplace practices.
8. Results
● Outcomes:
○ Increased Sales: Sellers implementing the recommendations saw a 30%
increase in successful transactions.
○ Improved User Satisfaction: Platform changes resulted in a 20% increase in
user satisfaction scores.
○ Reduced Time-to-Sell: Average time to sell items decreased by 25% for
optimized listings.
9. Conclusion
1. Heat Map: Displaying peak times and days for marketplace activity across different
product categories.
2. Scatter Plot: Showing the relationship between listing quality factors (e.g., number of
photos, description length) and time-to-sell.
3. Geographic Dashboard: Visualizing regional preferences for different product
categories and price points.