What GitHub Copilot's AI Credits Switch Really Costs Your Engineering Team

GitHub's June 2026 billing change is live. Here's what it actually costs a 100–500 engineer org — and what to do before September when the promotional credits expire.

Share

A 500-engineer organization on GitHub Copilot Enterprise pays $234,000 per year before a single AI model call. On June 1, 2026, GitHub changed how additional usage is billed — and for teams running agentic workflows, that "additional usage" line item has the potential to exceed the subscription itself.

This post walks through what changed, why it's a budget management problem (not just a pricing update), and what engineering leaders should do before September, when the promotional credits expire and the real numbers arrive.


What Actually Changed

For most of GitHub Copilot's history, billing was straightforward: pay per seat, receive a monthly allotment of Premium Request Units (PRUs), pay overage if you exceeded it. PRUs were the unit of account — a rough proxy for model usage, tied to specific premium features like Claude Sonnet responses or extended context windows.

On June 1, 2026, GitHub retired PRUs and replaced them with AI Credits. The conversion is simple: 1 AI Credit = $0.01 USD.

The subscription prices didn't change: - Copilot Business: $19 per seat per month - Copilot Enterprise: $39 per seat per month

What changed is the structure of what's included and how overages accumulate.

Through August 2026, GitHub is running a promotional period: Business accounts receive $30 worth of AI Credits per seat per month included; Enterprise accounts receive $70 per seat per month. These promotional allotments are generous — most standard-usage teams will comfortably stay within them through the summer.

After August, the permanent base allotment has not been clearly published. GitHub has not disclosed the exact credit amount that will be included in each plan once the promotional period ends. That ambiguity is the core of the budget problem — and the reason engineering leaders need to model their exposure now, not in September.


The Predictability Problem

When your billing model shifts from "pay $39/seat, use what you use within reason" to "pay $39/seat plus variable credit consumption," you've moved from a fixed cost to a partially variable one. For a finance team or VP of Engineering managing a headcount-based budget, this is a structural change — not just a pricing update.

Three specific mechanics make this harder to manage than it appears:

1. The promo credits create a false sense of security. The $30/$70 promotional allotments are intentionally generous. Most teams won't exceed them this summer, which means many organizations won't discover their true run rate until September — after the promo ends, the permanent allotment kicks in, and the first overage invoice arrives. You have a window to model your exposure before the bill tells you. 2. Unused credits expire. If your team has a slow sprint and consumes fewer credits than their monthly allotment, those credits don't carry over. There's no rollover mechanism. This means a light July doesn't offset a heavy August — every month resets. 3. There's no per-engineer or per-team usage dashboard by default. GitHub's billing interface shows organization-level consumption. If you manage 12 teams across 200 engineers and want to know which team burned 40% of your credits this month, that information is not natively surfaced. Building that visibility requires custom tooling or a third-party platform.

VS Magazine captured the developer community's reaction directly: "you will get less but pay the same price." That's the developer's frame. The budget owner's frame is more precise: "we don't know what we'll pay, and we won't know until the invoice arrives."


Real Cost Scenarios

The following models three usage patterns across three org sizes. All figures use Enterprise tier pricing ($39/seat/mo) and treat the post-August base allotment as approximately 3,000 credits per seat per month — a conservative estimate based on the promotional ceiling. GitHub has not published the permanent figure; treat these as planning estimates, not guarantees.

Scenario definitions: - Conservative: Primarily inline completions and standard chat. Minimal agentic use. Typical of teams that adopted Copilot before 2025 and haven't meaningfully changed their workflow. - Moderate: Regular agentic use — daily PR review agents, occasional multi-file refactors, some workspace agents. Representative of teams actively embracing AI-native development. - Heavy agentic: Multiple autonomous coding sessions per engineer per day. Premium models (Claude Sonnet, GPT-4o) selected. Agents touching 10+ files per session, running in background loops.

| Scenario | Credits/eng/mo | Subscription (100 eng) | Subscription (200 eng) | Subscription (500 eng) | |---|---|---|---|---| | Conservative | ~1,500 ($15) | $3,900 + ~$0 overage | $7,800 + ~$0 overage | $19,500 + ~$0 overage | | Moderate | ~4,500 ($45) | $3,900 + ~$1,500/mo | $7,800 + ~$3,000/mo | $19,500 + ~$7,500/mo | | Heavy agentic | ~15,000–50,000 ($150–$500) | $3,900 + $12,000–$47,000/mo | $7,800 + $24,000–$94,000/mo | $19,500 + $60,000–$235,000/mo |

Overage calculated as credits consumed beyond 3,000/seat/mo × $0.01.

The range in the heavy agentic row is not a rounding error — it's a genuine range driven by how your engineers use the tool. A 200-person team could see monthly overage swing between $24,000 and $94,000 depending on agent behavior. That's a budget planning problem, not a line item.


The Agentic Multiplier: Why One Session Can Drain Your Monthly Budget

Standard Copilot usage — tab completion, single-file chat, inline suggestions — is relatively credit-efficient. A developer generating completions for an eight-hour day consumes a few hundred AI Credits in model calls. Well within any reasonable allotment.

Agentic workflows are different in kind, not just degree.

When an engineer runs Copilot Workspace on a multi-file feature, triggers a PR review agent, or chains a coding agent across multiple tool calls, the model:

- Receives full context from multiple files (large input token count) - Writes extensive plans, code, and tests in a single generation pass - Runs multiple sequential tool calls, each with its own context window reload - May loop or backtrack as it evaluates intermediate results

GitHub's documentation acknowledges that agentic workflows consume 5x to 20x more tokens than standard completions. In practice, a single afternoon session refactoring a complex module with a coding agent can consume as many credits as a full week of inline completions for the same engineer.

For most engineering organizations in mid-2026, this isn't theoretical. If your teams are using Copilot Workspace, the coding agent, GitHub Spark, or any autonomous PR/review tooling, you're already in the agentic usage range. The question isn't whether your credit consumption will increase — it's whether you'll see it coming.

One scenario worth modeling explicitly: a 200-engineer team where 20% of engineers (40 people) are early adopters running two agentic sessions per day. Each session averages 10,000 tokens. That's 40 engineers × 2 sessions × 22 workdays × 10,000 tokens — a meaningful share of the organization's monthly credit pool, driven by a small minority of users. Standard billing dashboards won't surface that pattern without custom queries.


What Engineering Leaders Should Do Right Now

You have until at least September before the promotional credits expire and your permanent run rate becomes visible. Use that window deliberately.

1. Pull your current credit consumption now. Log into your GitHub organization's billing page and export usage for May and June. Even if you're comfortably within the promotional allotment, look at the shape: is consumption evenly distributed across engineers, or are 10–20% of users driving a disproportionate share? That pattern, if present, will amplify as agentic tools scale. 2. Survey teams on agentic feature usage. Send a three-question note to your engineering team leads: Are your developers using Copilot Workspace? The coding agent? GitHub Spark? The answers will tell you whether you're in the conservative or heavy agentic category. This takes five minutes and could save significant budget uncertainty in Q3. 3. Model your post-August exposure. Take your June credit consumption per engineer and apply a 1.5× buffer as a planning assumption for September onward. If the resulting number multiplied by $0.01 makes you uncomfortable at your headcount, you're in the moderate-to-heavy range and should establish governance before August. 4. Enable billing alerts before they're needed. GitHub allows spending limits and alerts at the organization level. They are not enabled by default. Set a monthly alert at 75–80% of your expected allotment so you're notified before you hit overage, not after the invoice closes. 5. Establish a model selection policy. Different models within Copilot carry different credit costs per call. Claude Sonnet and GPT-4o cost measurably more per interaction than the default base model. If your developers have free model choice, a small number of power users selecting premium models on every request can meaningfully affect your total. A lightweight policy — "use the base model as default; escalate to premium models for specific task types" — reduces overage exposure without a significant developer experience trade-off for most use cases.


The Bigger Picture

GitHub's shift to AI Credits isn't an isolated event. Cursor, Claude Code, and the broader class of agentic coding tools are all moving toward usage-based billing where the subscription is a floor and actual cost is determined by how intensively agents run. This is the new structural reality of enterprise AI tool budgeting.

The engineering organizations that manage this well won't necessarily be the ones spending the least — they'll be the ones with visibility into where spend is actually going. Understanding that one team consumes 8× the credits per engineer as another isn't just a finance number; it's signal about adoption depth, agent utilization, and whether your AI tooling investment is generating value or accumulating in the background.

At Olumia, we built specifically for this visibility gap: tracking usage, cost, and utilization across AI coding tools — Copilot, Cursor, Codeium, and others — in a single view. For organizations managing multiple tools simultaneously, knowing what you're paying and who's generating that spend isn't optional infrastructure. It's how you avoid a September billing surprise.

One question worth sitting with regardless of what tooling you use: the per-seat model was designed for human-speed coding assistance. Agent-speed workflows — code running overnight, PR review bots processing every merge, background refactoring agents — operate at a fundamentally different consumption rate. The right budgeting model for that future probably isn't the same one GitHub inherited from its 2021 launch. Worth thinking through before that reality arrives mid-budget-cycle.