The data Artisans Blog
Apache Flink, stream processing, event-driven applications, and more.
Real-time stream processing: The next step for Apache Flink™May 6, 2015
This post also appears as a guest post at the Confluent blog. Stream processing is becoming very popular with open source projects like Apache Kafka, Apache Samza, Apache Storm, Apache Spark’s Stre...
Announcing Google Cloud Dataflow on Flink and easy Flink deployment on Google CloudApril 5, 2015
Today, we are pleased to announce a deeper engagement between Google, data Artisans, and the broader Apache Flink™ community to bring easy Flink deployment to Google Cloud Platform, and enable Googl...
How to factorize a 700 GB matrix with Apache Flink™March 30, 2015
This article is a follow-up post to the earlier published article about Computing recommendations at extreme scale with Apache Flink. We discuss how we implemented the alternating least squares (ALS) ...
Computing Recommendations at Extreme Scale with Apache Flink™March 18, 2015
Note: This article is a summary of the more detailed article How to factorize a 700 GB matrix with Apache Flink™.Recommender Systems and Matrix FactorizationRecommender Systems are a very successful...
Flink becomes a Top-Level Apache ProjectJanuary 12, 2015
The Apache Software Foundation announced Flink as a Top-Level Apache project. Read more in the ASF’s press release. ...
Data Analysis with Flink: A case study and tutorialNovember 29, 2014
This article is a step-by-step guide to implement a fairly sophisticated data analysis algorithm, end-to-end in Apache Flink. We will use the PageRankalgorithm, an algorithm used for ranking entities ...