Search by tag python
Practical Apache Spark in 10 minutes. Part 7 - GraphX and Neo4j
A tutorial about the integration of Apache Spark GraphX tool with Neo4j database management system for the data analysis.
Read morePractical Apache Spark in 10 minutes. Part 5 - Streaming
An article which explains how to work with data which continuously comes as small records from different sources.
Read morePractical Apache Spark in 10 minutes. Part 4 - MLlib
An article with step-by-step instruction on the use of MLlib classification algorithms with Apache Spark.
Read morePractical Apache Spark in 10 minutes. Part 3 - DataFrames and SQL
A blog post which shows how to load a DataFrames and perform basic operations with both API and SQL.
Read morePractical Apache Spark in 10 minutes. Part 2 - RDD
The tutorial will be helpful to understand how to create and work with a distributed collection of items called RDD.
Read morePractical Apache Spark in 10 minutes. Part 1 - Ubuntu installation
With this article, we begin a series of blog posts to walk through the Practical Apache Spark for your tasks.
Read more