About 6,240,000 results
Open links in new tab
  1. file - What are the pros and cons of the Apache Parquet format …

    Apr 24, 2016 · Some characteristics of Apache Parquet are: Self-describing Columnar format Language-independent In comparison to Apache Avro, Sequence Files, RC File etc. I want an …

  2. Reading / Fixing a corrupt parquet file - Stack Overflow

    Sep 3, 2024 · Either the file is corrupted or this is not a parquet file. when I tried to construct a ParquetFile instance. I assume appending PAR1 to the end of the file could help this?

  3. How to read a Parquet file into Pandas DataFrame?

    How to read a modestly sized Parquet data-set into an in-memory Pandas DataFrame without setting up a cluster computing infrastructure such as Hadoop or Spark? This is only a …

  4. Python: save pandas data frame to parquet file - Stack Overflow

    Jan 29, 2019 · Is it possible to save a pandas data frame directly to a parquet file? If not, what would be the suggested process? The aim is to be able to send the parquet file to another …

  5. How to append new data to an existing parquet file?

    Apr 27, 2023 · I have parquet files with some data in them. I want to add more data to them frequently every day. I want to do this without having to load the object to memory and then …

  6. Spark parquet partitioning : Large number of files

    Jun 28, 2017 · Spark 2.2+ From Spark 2.2 on, you can also play with the new option maxRecordsPerFile to limit the number of records per file if you have too large files. You will …

  7. How to append data to an existing parquet file - Stack Overflow

    Aug 31, 2016 · Write multiple parquet files. Then combine them at a later stage. Write multiple parquet files. The tool you are using to read the parquet files may support reading multiple files …

  8. How to read partitioned parquet file into polars? - Stack Overflow

    Apr 24, 2023 · Note that the polars native scan_parquet now directly supports reading hive partitioned data from cloud providers, and it will use the available statistics/metadata to …

  9. What file extension is the correct way to name parquet files?

    Jun 3, 2025 · 0 According to .parquet is the official extension, they don't mention variations for the various compression methods used (which is exposed in the file itself). Might also be noted …

  10. io - Read / Write Parquet files without reading into memory (using ...

    Aug 17, 2021 · Open a parquet file for reading. Then use iter_batches to read back chunks of rows incrementally (you can also pass specific columns you want to read from the file to save …