Spark-Py-Notebooks
Visit Toolspark-py-notebooks offers Apache Spark & Python (pySpark) tutorials as IPython/Jupyter notebooks. It covers basic to advanced concepts for big data analysis and machine learning.
At a glance
Trending
spark-py-notebooks offers Apache Spark & Python (pySpark) tutorials as IPython/Jupyter notebooks. It covers basic to advanced concepts for big data analysis and machine learning.
Trending
About
spark-py-notebooks is a comprehensive collection of IPython/Jupyter notebooks designed to educate users on various Apache Spark concepts using Python (pySpark). The tutorials range from fundamental to advanced topics, focusing on Big Data Analysis and Machine Learning. Users can learn about RDD creation, basic RDD operations like map, filter, and collect, sampling, set operations, and data aggregations. The collection also delves into working with key/value pair RDDs and introduces MLlib for basic statistics, exploratory data analysis, logistic regression, and decision trees. Additionally, it covers Spark SQL for structured processing with DataFrames and includes applications like building a movie recommendation web service.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending