7.6.5. mclearn.preprocessing.csv_to_hdf

mclearn.preprocessing.csv_to_hdf(csv_path, no_files=1, hdf_path='store.h5', data_cols=None, expectedrows=7569900, min_itemsize=40, table_name='table')[source]

Convert csv files to a HDF5 table.

Parameters:
  • csv_path (str) – The path of the source csv files.
  • no_files (int) – The number of csv parts.
  • hdf_path (str) – The path of the output.
  • data_cols (array) – The names of the columns. Should be the same as the first line in the first csv file.
  • expectedrows (int) – The expected number of rows in the HDF5 table.
  • min_itemsize (int) – The minimum string size.
  • table_name (str) – The name of the HDF5 table.