In the pipeline: September 2023
Juan Luis Cano Rodríguez
Posted on September 6, 2023
This month: a roundup of the summer’s Kedro news, some release updates, and our top picks from recent articles.
Kedro team news
Over the last few months, we’ve been happy to welcome some new team members to the Kedro and Kedro-Viz teams, who have also joined our Technical Steering Committee. Welcome Dmitry Sorokin, Jitendra Gundaniya, Laura Couto, Ravi Kumar Pilla, and Vladimir Nikolic!
We are also pleased to announce a Kedro baby, delivered safely by one of the team, at the end of July!
Contributors news
We reworked the Kedro contributors guide in August, and moved it to the Kedro wiki. There are loads of different ways to contribute to Kedro and if you want to get involved, we encourage you to look at the table that introduces the Kedro contributor guide.
If you spot an article, podcast or video that discusses Kedro, you can also contribute by adding it to the “Awesome Kedro” repository, or letting us know on Slack.
There have been some amazing contributions in recent weeks, including the kedro-vineyard plugin for efficient intermediate sharing in Kedro pipelines, kedro-graphql for serving Kedro projects as a GraphQL API, and kedro-pandera to bring data validation to your Kedro projects.
Release news
August 2023 saw a set of releases to introduce Python 3.11 support across Kedro, Kedro-Viz and Kedro datasets.
Kedro version 0.18.13 included these major features and improvements:
Added support for Python 3.11.
Added new
OmegaConfigLoader
features: registering of custom resolvers throughCONFIG_LOADER_ARGS
and support for global variables.Added
kedro catalog resolve
CLI command that resolves dataset factories in the catalog with any explicit entries in the project pipeline.Simplified the
conf
folder structure for modular pipelines and updated kedro pipeline create and kedro catalog create accordingly.Made various updates to the Kedro project template and Kedro starters: use of OmegaConfigLoader, transition from
setup.py
topyproject.toml
, and updated for the simplifiedconf
structure.
Kedro Viz version 6.5 added support for Python 3.11, while Kedro Viz version 6.4 added two new features: feature hint cards to highlight key features of Kedro Viz and support for displaying dataset statistics in the metadata panel for further investigation.
Kedro Datasets version 1.7.0 added polars.GenericDataSet
, a dataset backed by polars, a lightning fast dataframe package built entirely using Rust. Kedro Datasets version 1.6.0 added support for Python 3.11.
Recently on the Kedro blog
In the last few weeks we’ve published the following on the Kedro blog:
We’re always looking for collaborators to write about their experiences using Kedro, particularly if you’re working with Kedro datasets or converting an existing project to use Kedro. Get in touch with us on our Slack workspace to tell us your story.
What we’ve learned
We really enjoyed reading more on Medium about the Kedro Vineyard plugin, which is a cloud-native data manager, for data sharing using memory in data science pipelines on Kubernetes.
Quix published an interesting article called “Bridging the gap between data scientists and engineers in machine learning workflows” which is something we regularly discuss within the team.
We found a super-interesting project about font recognition that uses Kedro.
And finally, we enjoyed reading more about data streaming with Kedro over on the QuantumBlack Medium channel.
That’s it for this edition!
Posted on September 6, 2023
Join Our Newsletter. No Spam, Only the good stuff.
Sign up to receive the latest update from our blog.