During the first week, interns will focus on mastering essential data cleaning techniques using Excel. The dataset comprises Retail Sales data from Rwanda with over 1,500 rows. Key tasks include exploring the dataset to understand its structure, handling missing values to ensure data completeness, correcting any inconsistencies for data accuracy, and recalculating totals where necessary. Additionally, interns will create pivot tables and charts to visualize sales patterns and summaries. The deliverables for this week are a cleaned Excel file, pivot tables demonstrating key insights, and a concise report (1–2 pages) summarizing the cleaning process and initial findings.
In the second week, the focus shifts to using Python for data analysis. The dataset involves HR Employee data from Rwanda, containing over 1,200 rows. Interns will learn to load datasets into Jupyter Notebooks and perform data cleaning tasks such as handling missing values, correcting data types, and performing feature engineering to create new variables like age and tenure. Descriptive statistics will be computed to understand data distributions. Visualizations using Matplotlib and Seaborn will help explore relationships and patterns. The deliverables include a well-documented Jupyter Notebook, charts and visualizations, and a report (Report #2) highlighting exploratory data analysis insights.
The third week introduces SQL for data analysis. Combining Retail Sales and HR data, each with over 1,500 rows, interns will import datasets into a SQL environment. Tasks involve writing SQL scripts to aggregate revenue per employee and department, filtering data, joining tables to combine relevant information, and analyzing top customers and cities. The deliverables are SQL scripts, query result outputs, and a report (Report #3) summarizing key insights derived from the SQL analysis.
In week four, participants will learn to build interactive dashboards using Power BI. Using the cleaned Retail Sales dataset, interns will import data into Power BI, develop dashboards showcasing sales trends, top-performing products, and payment methods. They will enhance dashboards with slicers and KPIs for better interactivity. The final output includes a Power BI (.pbix) file, exported PDF dashboards for presentation, and a report (Report #4) providing insights and actionable recommendations based on the visualizations.
The final optional week allows interns to undertake a comprehensive capstone project, choosing topics such as sales prediction, employee performance analysis, or dashboard development. They will document all analysis steps thoroughly and publish their code, datasets, and reports on a GitHub repository. The deliverables include the GitHub repo, a final presentation summarizing the project, and a comprehensive report (Report #5) reflecting the entire analysis process, methodologies, key findings, and recommendations.
In this lesson, you'll learn how to open and navigate your dataset in Excel, gaining confidence in reviewing columns, data types, and sample rows. By the end, you'll be able to analyze key aspects of your data, setting a strong foundation for further analysis and insights.
In this lesson, students will explore proven techniques for managing missing data and resolving inconsistencies within datasets. They will learn how to identify, handle, and correct data issues to ensure accurate and reliable analysis, empowering them to maintain high-quality data in their projects.
In this lesson, students will learn how to efficiently recalculate totals in Excel to ensure accurate data analysis. Additionally, they will master creating and customizing pivot tables to summarize, analyze, and visualize large datasets effortlessly. By the end of the lesson, you'll be able to streamline your data management and gain deeper insights with confidence.
In this lesson, students will learn how to create compelling charts and visual reports that effectively communicate data insights. They will explore various types of visualizations, understand best practices for data presentation, and develop skills to design clear, informative, and impactful reports for any audience.
In this lesson, students will learn essential techniques for cleaning and preparing data in Python, including identifying and handling missing data effectively. Through practical exercises, you'll gain the skills to improve data quality, ensure accurate analysis, and make informed decisions based on clean, reliable datasets.
In this lesson, students will learn how to effectively calculate and incorporate age and tenure features into their datasets to enhance model performance and data insights. Through practical techniques and best practices, you'll gain the skills to transform raw data into meaningful features that drive better decision-making. By the end, you'll be able to confidently engineer age and tenure features to unlock deeper understanding in your data projects.
In this lesson, students will learn how to summarize and interpret data using descriptive statistics. They will explore key techniques such as calculating measures of central tendency and variability, and learn how to effectively visualize data for clearer insights. By the end, students will be equipped to describe datasets confidently and accurately.
In this lesson, students will learn how to create compelling data visualizations using Matplotlib and Seaborn, two powerful Python libraries. They will gain hands-on experience designing and customizing various charts and plots to effectively communicate insights from data. By the end of the lesson, learners will be able to produce professional-quality visualizations to enhance their data analysis skills.
In this lesson, students will learn the fundamentals of SQL and how to use it for data analysis. They will gain hands-on skills in writing queries to extract, filter, and organize data from databases, setting a solid foundation for effective data analysis. By the end, learners will be equipped to perform basic data retrieval tasks using SQL.
In this lesson, students will learn how to import data into a SQL database and write fundamental SQL queries to retrieve and manipulate information. They will gain practical skills in importing datasets and crafting basic commands like SELECT, WHERE, and ORDER BY to explore and analyze data effectively. By the end, learners will be equipped to handle essential data operations in SQL with confidence.
In this lesson, students will learn essential techniques for effective data analysis, including how to aggregate revenue data to identify key insights, filter datasets to focus on relevant information, and join multiple tables to create comprehensive views. By mastering these skills, you'll be able to transform raw data into meaningful and actionable insights for smarter decision-making.
In this lesson, students will learn how to identify and analyze top customers and cities using SQL queries. Through practical exercises, you'll gain skills in ranking, filtering, and aggregating data to uncover key insights about customer behavior and regional trends. By the end, you'll be able to leverage SQL to make data-driven decisions for targeted marketing and strategic planning.
Discover the fundamentals of Power BI in this beginner-friendly lesson, where you'll learn how to connect data sources, create insightful visualizations, and build interactive dashboards. By the end, you'll have the skills to turn raw data into meaningful reports that empower data-driven decision making. Perfect for newcomers eager to harness the power of data analytics!
In this lesson, students will learn how to import various data sources and create basic dashboards to visualize information effectively. They will gain practical skills in organizing and presenting data to support decision-making and insights. By the end, learners will be equipped to build their own simple, interactive dashboards for different data projects.
In this lesson, students will learn how to effectively enhance their dashboards by adding interactive slicers and visually impactful Key Performance Indicators (KPIs). They will understand how to filter data dynamically and display vital metrics clearly, enabling more insightful and user-friendly data analysis. By the end, learners will be equipped to create more engaging and informative dashboards for better decision-making.
In this lesson, students will learn how to design effective and visually appealing dashboards that communicate data insights clearly. They will also explore best practices for exporting reports efficiently, ensuring professional presentation and accurate data sharing. By the end of the lesson, participants will be equipped with essential skills to create compelling dashboards and export reports confidently.
In this lesson, students will learn how to effectively choose a compelling capstone project aligned with their interests and goals, and develop a strategic plan to ensure its successful completion. They will gain practical skills in project selection, goal setting, and step-by-step planning techniques to set themselves up for success. By the end, you'll be equipped to make confident decisions and create a clear roadmap for your capstone journey.
Effective documentation is a vital component of the data analysis process. It ensures your workflow is transparent, reproducible, and accessible to colleagues or future you. Clear records of your steps help to identify errors, verify results, and facilitate collaboration. Moreover, comprehensive documentation enhances the credibility and integrity of your findings.
In this lesson, students will learn how to effectively publish their projects on GitHub, including creating repositories, managing commits, and pushing updates. By the end, you'll be able to confidently share your code online and collaborate with others. This step-by-step guide empowers you to showcase your work and leverage GitHub's powerful version control features.
In this lesson, students will discover effective strategies to prepare and deliver compelling final presentations. They will learn essential tips for organizing content, engaging their audience, and managing presentation nerves to confidently showcase their work. By the end, students will be equipped with practical techniques to excel in their final delivery.