Specialization "Data Analyst" - course 2900 rub. from Stepik, training 36 lessons, Date: October 29, 2023.
Miscellaneous / / December 05, 2023
Data analyst, product analyst, customer base analyst, CRM analyst, data scientist - all these professions are related to data analysis.
π― TOP requirements for these professions:
- Knowledge of SQL and experience working with databases;
- Knowledge of basic Python syntax and knowledge of the Pandas library;
- Knowledge of statistics and ability to apply it in data analysis;
- Analytical thinking.
This is not a complete list. For example, web analysts need to know Google Analytics and Yandex. Metrics, and for data scientists - machine learning. But I gave the basic requirements that are found in 70-80% of vacancies.
The Data Analyst specialization currently consists of two courses:
1οΈβ£ SQL for everyone
During the course, you will master SQL to the level of writing complex SQL queries and practice using data from a trading company in one of the most popular database management systems.
The course is designed for beginners who want to master SQL, as well as those who know SQL at a basic level, but want to fill in the gaps and consolidate their knowledge in practice.
2οΈβ£ Python: Data Analysis with Pandas
The course is devoted to practical work with Pandas. You will receive the necessary theory and reinforce it with a large number of practical problems.
The course is suitable for those who are already familiar with the basic Python syntax:
- Knows basic data types (including lists and dictionaries) and operations on them;
- Has an understanding of what a function and a method are.
The course can be taken even by beginners who can independently figure out how to install Python and the Pandas library. But, if you are just starting out with Python, you will have to catch up on some basic things on your own as the course progresses. By the way, you can get basic knowledge of Python syntax in this course. Then learning Pandas will be much easier.
The concept of the courses is based on three principles:
Simplicity
The presentation of the material is accessible and consistent - this will allow you to form the necessary knowledge base step by step.
Practice
Much attention is paid to practice - so that you not only solve the course problems, but also be able to apply knowledge in the future, on real projects.
Support
Feel free to ask questions in the comments, it is important for me that all the material is learned. I respond to comments within a day.
The order in which the courses are taken is not important.
1. SQL for everyone
Operations on one table
1. Introduction to the Database
2. Filtering WHERE rows. Regular expressions LIKE. AND and OR
3. Sorting strings ORDER BY
4. Aggregating functions: COUNT, SUM, MIN, MAX, AVG. Aliases
5. Grouping of GROUP BY and HAVING lines. Generating reports
6. Practice based on the results of the 1st module
Operations on multiple tables
1. Concepts of primary key and foreign key. Types of relationships in the database
2. Generating queries from multiple tables. INNER JOIN and Aliases
3. LEFT JOIN and other types of JOINs
4. Joining multiple tables using UNION and UNION ALL
5. Subqueries
6. Practice based on the results of the 2nd module
Additional Important Topics
1. CASE expression
2. Popular functions for working with strings
Practical tasks to reinforce course material
1. Simple queries
2. Complex queries
3. Conclusion
2. Python: Data Analysis with Pandas
Analyzing dataframes individually
1. Reading data from files
2. Express introduction to data
3. Column output
4. Data types
5. Filtering rows
6. Regular Expressions
7. Sorting strings
8. Aggregation functions
9. Row grouping
Analysis of multiple related dataframes
1. Merge a. k. a join
2. Concatenation
Additional required module
1. Working with dates and times
2. Pivot tables
3. New ways to create dataframes
4. Categorization of nominal features
5. Replacing values ββin a dataframe
6. Visualization in pandas
7. Slicing
8. Conclusion