Python 使用 h5py 删除 hdf5 数据集

Question

提问by hsnee

Is there any way to remove a dataset from an hdf5 file, preferably using h5py? Or alternatively, is it possible to overwrite a dataset while keeping the other datasets intact?

有没有办法从 hdf5 文件中删除数据集，最好使用 h5py？或者，是否可以覆盖一个数据集同时保持其他数据集完好无损？

To my understanding, h5py can read/write hdf5 files in 5 modes

据我了解，h5py 可以在 5 种模式下读/写 hdf5 文件

f = h5py.File("filename.hdf5",'mode')

where mode can be rfor read, r+for read-write, afor read-write but creates a new file if it doesn't exist, wfor write/overwrite, and w-which is same as wbut fails if file already exists. I have tried all but none seem to work.

其中，模式可以是r读，r+读，写，a读，写，但如果创建它不存在，一个新的文件，w用于写/改写， w-这是一样的w，但如果文件已经存在失败。我已经尝试了所有但似乎没有工作。

Any suggestions are much appreciated.

任何建议都非常感谢。

Answer 1

采纳答案by EnemyBagJones

Yes, this can be done.

是的，这是可以做到的。

with h5py.File(input,  "a") as f:
    del f[datasetname]

You will need to have the file open in a writeable mode, for example append (as above) or write.

您需要以可写模式打开文件，例如追加（如上）或写入。

As noted by @seppo-enarvi in the comments the purpose of the previously recommendedf.__delitem__(datasetname)function is to implement thedeloperator, so that one can delete a dataset usingdel f[datasetname]

正如@seppo-enarvi 在评论中所指出的，先前推荐的f.__delitem__(datasetname)函数的目的是实现del运算符，以便可以使用del f[datasetname]

Answer 2

回答by agomcas

I do not understand what has your question to do with the file open modes. For read/write r+ is the way to go.

我不明白您的问题与文件打开模式有什么关系。对于读/写 r+ 是要走的路。

To my knowledge, removing is not easy/possible, in particular no matter what you do the file size will not shrink.

据我所知，删除并不容易/不可能，特别是无论你做什么，文件大小都不会缩小。

But overwriting content is no problem

但是覆盖内容没问题

f['mydataset'][:] = 0

Answer 3

回答by Felix

I tried this out and the only way I could actually reduce the size of the file is by copying everything to a new file and just leaving out the dataset I was not interested in:

我试过了，我实际上可以减小文件大小的唯一方法是将所有内容复制到一个新文件中，而只留下我不感兴趣的数据集：

fs = h5py.File('WFA.h5', 'r')
fd = h5py.File('WFA_red.h5', 'w')
for a in fs.attrs:
    fd.attrs[a] = fs.attrs[a]
for d in fs:
    if not 'SFS_TRANSITION' in d: fs.copy(d, fd)

Python 使用 h5py 删除 hdf5 数据集

提问by hsnee

采纳答案by EnemyBagJones

回答by agomcas

回答by Felix

相关推荐

最近更新

标签

Python 使用 h5py 删除 hdf5 数据集

提问by hsnee

采纳答案by EnemyBagJones

回答by agomcas

回答by Felix

相关推荐

NLTK 命名实体识别到 Python 列表

打开不同目录下的所有文件python

Python Pyinstaller 和 --onefile：如何在 exe 文件中包含图像

python - 如何将可变数量的参数格式化为字符串？

相关推荐

最近更新

标签