![4. Spark SQL and DataFrames: Introduction to Built-in Data Sources - Learning Spark, 2nd Edition [Book] 4. Spark SQL and DataFrames: Introduction to Built-in Data Sources - Learning Spark, 2nd Edition [Book]](https://www.oreilly.com/api/v2/epubs/9781492050032/files/assets/lesp_0401.png)
4. Spark SQL and DataFrames: Introduction to Built-in Data Sources - Learning Spark, 2nd Edition [Book]
how to read from HDFS multiple parquet files with spark.index.create .mode("overwrite").indexBy($"cellid").parquet · Issue #95 · lightcopy/ parquet-index · GitHub
From Spark-Scala it is not possile to read a parquet file created with pyarrow · Issue #1470 · apache/arrow · GitHub
![amazon s3 - Spark input size when reading Parquet from S3 is 2x higher than from local FS - Stack Overflow amazon s3 - Spark input size when reading Parquet from S3 is 2x higher than from local FS - Stack Overflow](https://i.stack.imgur.com/ZHM4E.png)