Methods

Summary

method

class parent

truncated documentation

__getitem__

StreamingDataFrame

Implements some of the functionalities pandas offers for the operator [].

__init__

StreamingDataFrame

__init__

JsonIterator2Stream

__init__

JsonPerRowsStream

__init__

StreamingInefficientException

This method is inefficient in streaming mode and not implemented.

__iter__

StreamingDataFrame

Iterator on a large file with a sliding window. Each windows is a DataFrame. The method stores a …

__iter__

JsonIterator2Stream

Iterate on each row.

_concath

StreamingDataFrame

_concatv

StreamingDataFrame

_reservoir_sampling

StreamingDataFrame

Uses the reservoir sampling algorithm to draw a random sample …

add_column

StreamingDataFrame

Implements some of the functionalities pandas offers for the operator [].

apply

StreamingDataFrame

Applies pandas.DataFrame.apply. This function returns a StreamingDataFrame.

applymap

StreamingDataFrame

Applies pandas.DataFrame.applymap. This function returns a StreamingDataFrame.

concat

StreamingDataFrame

Concatenates dataframes. The function ensures all pandas.DataFrame or StreamingDataFrame

ensure_dtype

StreamingDataFrame

Ensures the dataframe df has types indicated in dtypes. Changes it if not.

fillna

StreamingDataFrame

Replaces the missing values, calls pandas.DataFrame.fillna.

get_kwargs

StreamingDataFrame

Returns the parameters used to call the constructor.

getvalue

JsonPerRowsStream

Returns the whole stream content.

groupby

StreamingDataFrame

Implements the streaming pandas.DataFrame.groupby. We assume the result holds in memory. The out-of-memory …

groupby_streaming

StreamingDataFrame

Implements the streaming pandas.DataFrame.groupby. We assume the result holds in memory. The out-of-memory …

head

StreamingDataFrame

Returns the first rows as a DataFrame.

is_stable

StreamingDataFrame

Tells if the dataframe is supposed to be stable.

iterrows

StreamingDataFrame

See pandas.DataFrame.iterrows.

merge

StreamingDataFrame

Merges two StreamingDataFrame and returns StreamingDataFrame. right can be either a StreamingDataFrame

read

JsonIterator2Stream

Reads the next item and returns it as a string.

read

JsonPerRowsStream

Reads characters, adds ,, [, ] if needed. So the number of read characters is not recessarily …

readline

JsonPerRowsStream

Reads a line, adds ,, [, ] if needed. So the number of read characters is not recessarily the …

sample

StreamingDataFrame

See pandas.DataFrame.sample. Only frac is available, otherwise choose reservoir_sampling(). …

sort_values

StreamingDataFrame

Not implemented.

tail

StreamingDataFrame

Returns the last rows as a DataFrame. The size of chunks must be greater than n to get n

to_csv

StreamingDataFrame

Saves the DataFrame into string. See pandas.DataFrame.to_csv.

to_dataframe

StreamingDataFrame

Converts everything into a single DataFrame.

to_df

StreamingDataFrame

Converts everything into a single DataFrame.

train_test_split

StreamingDataFrame

Randomly splits a dataframe into smaller pieces. The function returns streams of file names. It …

where

StreamingDataFrame

Applies pandas.DataFrame.where. inplace must be False. This function returns a StreamingDataFrame. …

write

JsonIterator2Stream

The class does not write.