Data
Data#
- class torchmimic.data.DecompensationDataset(root, train=True, n_samples=None)
Decompensation dataset that can be directly used by PyTorch dataloaders. This class preprocessing the data the same way as “Multitask learning and benchmarking with clinical time series data”: https://github.com/YerevaNN/mimic3-benchmarks
- Parameters
root (str) – directory where data is located
train (bool) – if true, the training split of the data will be used. Otherwise, the validation dataset will be used
n_samples – number of samples to use. If None, all the data is used
- class torchmimic.data.IHMDataset(root, train=True, n_samples=None)
In-Hospital-Mortality dataset that can be directly used by PyTorch dataloaders. This class preprocessing the data the same way as “Multitask learning and benchmarking with clinical time series data”: https://github.com/YerevaNN/mimic3-benchmarks
- Parameters
root (str) – directory where data is located
train (bool) – if true, the training split of the data will be used. Otherwise, the validation dataset will be used
n_samples – number of samples to use. If None, all the data is used
- class torchmimic.data.LOSDataset(root, train=True, partition=10, n_samples=None)
Length-of-Stay dataset that can be directly used by PyTorch dataloaders. This class preprocessing the data the same way as “Multitask learning and benchmarking with clinical time series data”: https://github.com/YerevaNN/mimic3-benchmarks
- Parameters
root (str) – directory where data is located
train (bool) – if true, the training split of the data will be used. Otherwise, the validation dataset will be used
partition – number of patitions to use for binning
n_samples – number of samples to use. If None, all the data is used
- class torchmimic.data.PhenotypingDataset(root, train=True, n_samples=None)
Phenotyping dataset that can be directly used by PyTorch dataloaders. This class preprocessing the data the same way as “Multitask learning and benchmarking with clinical time series data”: https://github.com/YerevaNN/mimic3-benchmarks
- Parameters
root (str) – directory where data is located
train (bool) – if true, the training split of the data will be used. Otherwise, the validation dataset will be used
n_samples – number of samples to use. If None, all the data is used