Beam.png

February-March 2019 | Newsletter

What’s been done


Apache Beam 2.10.0 released (by: many contributors)

Apache Beam awarded the 2019 Technology of the Year Award!

  • InfoWorld just awarded Beam the 2019 Technology of the Year Award.
  • See this  article for more details.

Kettle Beam 0.5 released with support for flink (by: Matt Casters)

  • Kettle now supports Apache Flink as well as Cloud Dataflow and Spark.
  • See Matt’s Blog for more details.

What we’re working on...


Apache Beam 2.11.0 release (by: many contributors)

Hive Metastore Table provider for SQL (by: Anton Kedin)

  • Support for plugging table providers through Beam SQL API to allow obtaining table schemas from external sources.
  • See the PR for more details.

User Defined Coders for the Beam Go SDK (by: Robert Burke)

  • Working on expanding the variety of user defined types that can be a member of a PCollection in the Go SDK.
  • See BEAM-3306 for more details.

Python 3 (by: Ahmet Altay, Robert Bradshaw, Charles Chen, Mark Liu, Robbe Sneyders, Juta Staes, Valentyn Tymofieiev)

  • Beam 2.11.0 is the first release offering partial Python 3 support.
  • Many thanks to all contributors who helped to reach this milestone.
  • IO availablility on Python 3 is currently limited and only Python 3.5 version has been tested extensively.
  • Stay tuned on BEAM-1251 for more details.


Notebooks for quickstarts and custom I/O (by: David Cavazos)

  • Adding IPython notebooks and snippets
  • See [BEAM-6557] for more details.



      New members


New PMC member!

  • Etienne Chauchot, Nantes, France

New Committers!

  • Gleb Kanterov, Stockholm, Sweden
  • Michael Luckey

New Contributors!

  • Kyle Weaver, San Francisco, CA
  • Would like to help begin implementing portability support for the Spark runner
  • Tanay Tummapalli, Delhi, India
  • Would like to contribute to Open Source this summer as part of Google Summer of Code
  • Brian Hulette, Seattle, WA
  • Contributing to Beam Portability
  • Michał Walenia, Warsaw, Poland
  • Working on integration and load testing
  • Daniel Chen, San Francisco, CA
  • Working on Beam Samza runner


      Talks & meetups


Plugin Machine Intelligence and Apache Beam with Pentaho - Feb 7 @ London

  • Watch the How to Run Kettle on Apache Beam video here.
  • See event details here..

Beam @Lyft / Streaming, TensorFlow and use-cases - Feb 7 @ San Francisco, CA

.

Flink meetup - Feb 21@ Seattle, WA

  • Speakers from Alibaba, Google, and Uber gave talks about Apache Flink with Hive, Tensorflow, Beam, and AthenaX.
  • See event details here and presentations here.

Beam Summit Europe 2019 - June 19-20 @ Berlin

  • Beam Summit Europe 2019 will take place in Berlin on June 19-20.https://gdoc.pub/doc/e/2PACX-1vTQIS4WkxV-HpgX5Lb6q05g4-wuIVcYd82123Mp4Y6q9fMv6Ynwd-l7dI4TrMyCrKilyU-YsoitbnZB
  • Speaker CfP and other details to follow soon!
  • Twitter announcement!


   
 Resources


Apache Jira Beginner’s Guide (by:  Daniel Oliveira)

  • A guide to introduce Beam contributors to the basics of using the Apache Jira for Beam development. Feedback welcomed!

An approach to community building from Apache Beam (by: Kenn Knowles)

  • The Apache Software Foundation has published committer guidelines to help Beam's community building work.
  • See the post on the ASF blog.

Exploring Beam SQL on Google Cloud Platform (by: Graham Polley)

  • “In this article, I’ll dive into this new feature of Beam, and see how it works by using a pipeline to read a data file from GCS, transform it, and then perform a basic calculation on the values contained in the file”.
  • See article and full source code.
    .

Until Next Time!