Bltools V2.2 -
| Operation | v2.1 (single-thread) | v2.2 (parallel) | Improvement | |--------------------|----------------------|-----------------|--------------| | Filter + 5 rules | 28 min 40 sec | 6 min 12 sec | | | Format conversion | 18 min 22 sec | 4 min 05 sec | 4.5x | | Schema validation | 32 min 10 sec | 7 min 48 sec | 4.1x |
rules: - field: email validate: MATCHES_REGEX ^\S+@\S+\.\S+$ on_fail: reject - field: age validate: BETWEEN 0 AND 120 on_fail: default(18) Run: bltools v2.2
bltools transform --input weekly_data --state process.state --resume For reproducible pipelines, use the official bltools v2.2 container: | Operation | v2
In the fast-evolving landscape of data transformation, ETL (Extract, Transform, Load) processes, and business logic automation, staying updated with the right tools is crucial. For data engineers, analysts, and DevOps professionals who rely on lightweight, scriptable utilities, the release of bltools v2.2 marks a significant milestone. and business logic automation
bltools migrate --old-config ./rules_v1.yaml --new-config ./rules_v2.yaml Using a 50 GB CSV file with 500 million rows, on an 8-core/16-thread server:
#bltools #bltoolsV2 #DataEngineering #ETL #OpenSource