http://carray.pytables.org/docs/manual/

obviously these also offer persistence as well but this is a another dependence

Comment From: heroxbd

The URL changed to,

http://www.pytables.org/usersguide/libref/homogenous_storage.html

I really like to have HDF5 read/write in CArray to/from Pandas.

For example:

ptdump tt-014662.h5
/ (RootGroup) ''
...
/t4 (CArray(12671,), shuffle, zlib(4)) ''

As of 0.17.1 read_hdf of Pandas cannot read CArray "/t4" in the file.

But PyTables (3.2.2) can read it with:

import tables as tb
x = tb.open_file("tt-014662.h5")
x.get_node("/t4")

/t4 (CArray(12671,), shuffle, zlib(4)) ''
  atom := Float32Atom(shape=(), dflt=0.0)
  maindim := 0
  flavor := 'numpy'
  byteorder := 'little'
  chunkshape := (1584,)

Comment From: jreback

this issue is about a different issue actually. more of a mem-mapped back thing.

only PyTables Table's are readable directly in pandas. You can just read other structures using exactly the code you show above. What is the problem?

Comment From: heroxbd

Thanks for your explanation and sorry for hijacking this issue.

I just expected Pandas to read CArray off-the-shelf. Maybe it is a good idea to document this limitation of read_hdf5 in http://pandas.pydata.org/pandas-docs/stable/io.html#io-hdf5.

Comment From: jreback

if u want to submit a pull request to the effect that pandas does not in general read PyTables / h5py formats (except for specific external compat) then that would be ok

Comment From: heroxbd

On my list, thanks.