Support for the Apache Parquet file format
Shadi Akiki
shadi at autofitcloud.com
Fri Nov 22 07:37:12 UTC 2019
Hello! Is there any work being done to support the Apache Parquet file
format?
The two data processing tools that I use locally with parquet are:
* python pandas [1] [2] for programmatic access
* visidata [3] (CLI that uses pandas under the hood) for more
"interactive" access
I'm wondering why Parquet is not yet a supported format in LibreOffice
Calc (and most desktop worksheet processing tools for that matter).
On an unrelated note, I was also surprised to find out that Tableau only
supports Parquet through a database server like Apache Drill [4][5].
I feel that parquet files are under-rated, and that perhaps pushing for
native desktop application support would encourage its usage over the
standard (and less efficient) CSV file format. I may be completely wrong
and would welcome feedback.
* [1]
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_parquet.html
* [2]
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_parquet.html
* [3] https://visidata.org/formats/
* [4] https://drill.apache.org/docs/tableau-examples/
* [5] https://drill.apache.org/docs/parquet-format/
--
Shadi Akiki
Founder & CEO, AutofitCloud
https://autofitcloud.com/
+1 813 579 4935
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice/attachments/20191122/4a2bac72/attachment.html>
More information about the LibreOffice
mailing list