Data Egnineer, Open Source Software enthusiast, Apache Software Foundation committer.
I'm developing in Python, Scala/Java and some Rust. Mostly my activities are related to the Apache Spark / PySpark ecosystem and Data Engineering tools.
I'm a maintainer at the following projects:
- GraphFrames -- scalabale graph algorithms on top of Apache Spark DataFrames.
- Apache GraphAr (incubating) -- universal "open-table" format for storing Property Graphs.
- graphframes-rs -- vertex-centric graph algorithms on top of Apache Datafusion.
- spark-fast-tests -- Apache Spark testing helpers and assertions (Scala).
- chispa -- Apache Spark testing helpers and assertions (Python).
- falsa -- CLI tool for generating datasets of the H2O benchmark. Wriiten in Rust.
And other various projects.
Wakatime weekly stats:
Rust 6 hrs 48 mins ████████░░░░░░░░░░░░░░░░░ 31.54 %
Python 5 hrs 31 mins ██████▒░░░░░░░░░░░░░░░░░░ 25.64 %
Scala 5 hrs 27 mins ██████▒░░░░░░░░░░░░░░░░░░ 25.34 %
TOML 59 mins █░░░░░░░░░░░░░░░░░░░░░░░░ 04.60 %
Java 56 mins █░░░░░░░░░░░░░░░░░░░░░░░░ 04.33 %
About any open source activities and / or collaborations you can reach me using [email protected].
About any other activities and / or collaborations you can reach me using my private email [email protected].