forensics features migrate
Upgrade every feature parquet to the current schema version.
Real-corpus parquets store only article_id; URLs live in
articles.db. The migrator JOINs against that DB once per run to derive
section for every row. If the DB is missing, rows without a url
column fall back to section = "unknown" (with a WARNING per file).
forensics features migrate [OPTIONS]Options
Section titled “Options”| Option | Description |
|---|---|
--features-dir PATH | Override the features directory (default: <project_root>/data/features). |
--articles-db PATH | Override the SQLite DB used for the article_id -> url JOIN (default: <project_root>/data/articles.db). |
--dry-run | Log the would-be changes without touching any files. |
--help | Show this message and exit. |
Built by Abstract Data