eh corpus¶
Manage training corpora — sync source files from TOML manifests.
Usage¶
Corpus manifests¶
Manifests are TOML files describing where to find source code:
[corpus]
name = "elite-python"
language = "python"
class = "A"
[[corpus.sources]]
type = "github"
repo = "org/repo"
branch = "main"
paths = ["src/"]
[[corpus.sources]]
type = "local"
path = "/path/to/code"
Example¶
Downloads/copies source files to corpus/elite-python/ for training.