Skip to content

DeftaAndrei/Logo_Clasifier

Repository files navigation

README: Advanced Apache Parquet Data Analysis

Apache Parquet is a columnar storage format optimaed for big data proccesing

This Repostory contain a set of Python scripts and algorithms designed to efficiently interpret and analyze Parquet datasets. Parquet datasets Advantages of Using Parquet Columnar Storage: Faster queries for specific fields Compression: Smaller file size compared to CSV or JSON Schema Evolution: Can handle changes in data structure Efficient Filtering: Uses predicate pushdown for faster scans

image image

image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages