Skip to main content

fork

Given some INPUT data, it generates an OUTPUT data file (.txt, .csv, .json), given any number of optional data augmentation operations. This command is most useful as a playground to test and experiment with data augmentation pipelines before training a model, and as a shortcut for converting datasets from one file type to another.

Usage

hxmx fork <INPUT> [OUTPUT] [-op <TEXT>] [-s <INTEGER>] [-d <CHOICE>] [--debug] [--help]

Arguments

NameTypeRequiredDefault
INPUTpath — must exist, file
OUTPUTpath — fileoutput.csv

Options

NameTypeDefaultDescription
--operations, -opstring (multiple)[]Data augmentation operation(s) to stochastically apply during training. See operations.
--seed, -sinteger0Random seed. Use 0 for non-deterministic results.
--device, -dchoice (auto|cpu| ...)autoComputing device. To list all available devices, run hxmx devices.
--debugbooleanFalseDebug mode. Includes traceback when an error is raised.
-h, --helpbooleanFalseOpen documentation in browser.