Project Icon

scio

Seamless Scala Integration with Apache Beam and Google Cloud Dataflow

Product DescriptionScio is a Scala-based API designed for efficient integration with Apache Beam and Google Cloud Dataflow. Drawing inspiration from Apache Spark and Scalding, it supports both batch and streaming models and offers extensive compatibility with Google Cloud services like BigQuery and Pub/Sub, alongside other tools such as Avro and TensorFlow IOs. Scio empowers developers with type-safe queries and powerful data orchestration capabilities, making distributed data processing seamless and accessible.
Project Details