Tracking AI-generated commits across all public GitHub repos. Updated monthly from GH Archive via BigQuery.
Methodology: We observe 5.19M Claude commits in Feb 2026 from public repos. Since 81.5% of GitHub activity is private, and assuming similar AI adoption in private repos, the total is estimated at 5.19M ÷ 0.185 ≈ 28M commits/month. This is speculative — enterprise adoption may be higher or lower than public repo rates. Copilot and Cursor leave no commit-level attribution markers and are excluded entirely.
Denominator: total push events from BigQuery (not total commits — each push averages ~3 commits). Diff size adjustment: AI commits have ~2.5x larger diffs than human commits (per published research), so the code volume share is estimated at 2.5x the raw commit share. Both lines shown. Copilot/Cursor are excluded (no attribution markers) — real AI share is higher.
Best fit by AIC: Exponential. The logistic ceiling converged to the upper bound (~1B), meaning the data does not yet show saturation. With only 14 months of observations and a data quality gap at month 10, we cannot distinguish exponential from early-stage logistic. This does not predict indefinite exponential growth — it means the inflection point has not yet been observed.
| Tool | Total Commits | Peak Month Repos | Peak Month Devs |
|---|
Data sources
Detection signals
Co-Authored-By: Claude ... @anthropic.com trailers, Generated with [Claude Code] body markers, noreply@anthropic.com author email.aider: prefix in commit messages, Co-Authored-By: aider trailers.Co-Authored-By: ... @openai.com, noreply@openai.com author email.devin-ai-integration actor/author in BigQuery. Nov+: co-authored-by devin-ai via GitHub Search API (author-based search returned 0; co-author trailer search works). Detection method differs between periods.Known limitations
Model fitting
scipy.optimize.curve_fit (nonlinear least squares): Exponential (2 params), Logistic (3 params), Gompertz (3 params).