jason blahovec
open source · published libraries

Open source.

Idempotent, self-verifying BigQuery ingestion libraries — each a pip-installable CLI with a ColumnSpec-driven docs generator. Pick one for the deep dive.

open source
4 published · BigQuery ingestion libraries
statcast-bigqueryv0.4.0
Idempotent Statcast → BigQuery ingestion, with first-class docs for SQL/LLM agents and round-trip validation against Baseball Savant.
118 tests · Savant round-trip verification · resumable chunked backfill
PythonBigQuerypybaseballCLI
yfinance-bigqueryv0.1.0
Idempotent Yahoo Finance OHLCV → BigQuery ingestion across 5 intervals, with first-class docs for SQL/LLM agents and internal-consistency verification.
126 tests · 5 intervals · BigQuery-internal verification
PythonBigQueryyfinanceCLI
nhl-bigqueryv0.2.2
Idempotent NHL play-by-play (with on-ice arrays merged from shift-charts) → BigQuery ingestion, with first-class docs for SQL/LLM agents and verification against the NHL public API.
126 tests · 6 tables in lockstep · on-ice arrays from shift-charts
PythonBigQueryNHL APICLI
nhl-hut-bigqueryv0.1.1
Idempotent EA NHL Ultimate Team ratings → BigQuery snapshots (skater + goalie endpoints unified), with a cross-source resolver that links HUT cards to NHL player IDs.
56 tests · skater + goalie snapshots · normalized-name card↔player resolver
PythonBigQueryEA NHL HUTCLI