Big Data and Data Science: move to the next level - free course from Stepik, training 11 lessons, Date November 28, 2023.
Miscellaneous / / November 29, 2023
If you already know a little about Data Science and want to continue learning, fill in gaps and get more practice, this course is for you. As part of the program, you will delve deeper into the field of Data Science - get acquainted with the MapReduce architecture and the Apache Hadoop ecosystem, understand the design of Apache Spark and Apache Parquet, and master the basics of neural networks and their architectures. You will also learn how to conduct business analytics with Power BI and what soft skills a project manager needs.
The purpose of the course is to draw your attention to Big Data and Data Science, therefore all course materials are for informational purposes without diving into all the intricacies. If you are interested in a deep dive, then use the additional module materials.
The course consists of five modules that will help you systematically grow in the field of Data Science and timely train the acquired theory in practice:
1. Dive into data science.
Remember what processes data analysis involves. Learn the basics of how long-term storage works. You will work with SQL using the Oracle DBMS as an example and create your first database. Get familiar with the MapReduce architecture and the Apache Hadoop ecosystem.
2. Tools for data processing, analysis and data visualization.
Learn what Power BI is and what problems can be solved with its help. Learn to obtain, model, analyze data and build visualizations.
3. Tools for working with big data.
You will understand how Apache Spark, a tool for working with big data, works. Learn the Apache Parquet data storage format and its features. Also, look at examples of working with Pyspark in the Jupyter notebook.
4. Machine learning systems.
You will learn what p-value is, why the necessary statistical criteria are needed, and what tasks they help with. Learn the concept of neural networks, their features and basic architectures. You will also understand how to develop a lean and pragmatic approach to using big data.
5. Soft Skills and Project Management.
Learn the principles and techniques of project management. Consider how the life cycle of a project managed in the Agile paradigm works. You will also learn what key soft skills an expert needs to develop leadership and project management skills.
Who is this course for?
The course is aimed at everyone who has basic knowledge of Data Science and wants to study the field further. The course is also suitable for specialists in IT and related fields who are interested in the use of machine learning in business and graduates of the course "Big Data and Data Science: start the dive from scratch" who wish to continue their education.
Initial requirements
To make the course clear and useful, you must have advanced computer literacy skills and basic knowledge of programming and SQL. You can get them in the previous course of our project - “Big Data and Data Science: start your dive from scratch.”
3
courseData Scientist, ML/DL researcher, teacher
Experience in analytics - 5 years. Worked as a Data Scientist at PJSC Megafon. Conducts courses in Skillbox, Netology, Yandex. Workshop and other educational projects. Speaker at the Big Data Days 2021 conference.
Data Scientist freelancer, teacher and ML/DL researcher, course author. He worked as a researcher in the field of "Decision Theory" in military service. Afterwards he collaborated with large and small companies. Ex-DataScientist PJSC Megafon.
1
wellWe contribute to the development of schoolchildren and students from Russian regions and developing cities neighboring countries, we transfer to them the experience and expertise of metropolitan universities, companies and large international IT hubs
The Russian School of Programming, abbreviated as RSP, operates in the field of education of children and adults in the IT and development spheres soft skills in the formats of training camps, circle movements, master classes, webinars, workshops, online courses and conferences. Our educational activities are based on intensity and deep immersion in the topic, the transfer of experience from seniors and experienced to beginners, mentoring and volunteering. We serve the ideas of accessible practical education and nurturing a new generation of personnel for the country’s digital economy. Our mission is to help young people become leaders of technological change.
Acquaintance
1. Greetings
Dive
1. Introduction to Data Science
2. Dive into SQL
3. Big Data. Introduction to MapReduce. Introduction to the Hadoop Ecosystem
Data processing, analysis and data visualization tools
1. Analyze data in Power BI
Big Data Tools
1. Apache Spark framework
Machine learning systems
1. Statistics for beginners
2. Machine learning in business
3. Neural Network Basics
Soft Skills and Project Management
1. Data Project Management
Completion
1. Course Summary and Outcomes