Inputs / Outputs

Dataframes / Numpy arrays

HDF5 is easy to manipulate in the Python world but difficult to exchange with other people and other environments. The two following functions makes it easier to collapse many dataframes or numpy arrays into one single file. The data can be unzipped afterwards.

pandas_streaming.df.read_zip (zipfilename, zname = None, kwargs)

Reads a dataframe from a zip file. It can be saved by read_zip.

pandas_streaming.df.to_zip (df, zipfilename, zname = ‘df.csv’, kwargs)

Saves a Dataframe into a zip file. It can be read by to_zip.