Experimentation & Conversion Intelligence
Done with gut feel and client-side flickering. We build the server-side A/B test architecture and product analytics setup that grows your conversion rate without sacrificing Core Web Vitals.
- PostHog
- GrowthBook
- Statsig
- Kameleoon (Server-Side)
- Vercel Edge / Cloudflare Workers
- BigQuery (Frankfurt)
- GA4 Exploration
Google Optimize is dead. Client-side test frameworks inject variants after the first paint and break Core Web Vitals in the process. Adblockers filter exposures, ITP kills the bucketing cookie. Anyone still assigning in the browser in 2026 is testing against noise.
We move the assignment logic to the edge. Server-side bucketing before render, exposure logging in BigQuery, Bayesian or Frequentist significance depending on the question. The variant arrives fully rendered from the edge handler, no FOUC, no CLS jump.
kaputt
Client-side
- Browser loads control
- A/B snippet in
<head> - DOM mutation after first paint
- FOUC + CLS jump
- Adblocker filters ~25 % exposures
- ITP kills 7-day cookie
- Bucketing decays after 1 week
resilient
Datascale (edge)
- Request hits edge handler
- Hashed user ID → bucket
- Variant HTML served directly
- Zero CWV impact
- Exposure event → PostHog + BigQuery
- Stable bucketing (server state)
- GrowthBook / Statsig backend
Architecture: server-side assignment at the edge, before the first paint. Bucket cookie, exposure event and variant rendering happen atomically before the browser receives the HTML.
What we build
Most experimentation programmes don't fail on weak hypotheses. They fail on setup leaks. Mis-attributed exposures, filtered bucketing, prematurely stopped tests, the same errors repeat in every CRO audit we run.
Experimentation & Conversion Intelligence closes those leaks at the architecture, not in the evaluation spreadsheet. Edge-based assignment. Exposure logging with PII separation. Bayesian and Frequentist side by side, with the method choice documented.
The datascale difference: We're not a CRO agency that paints buttons red. We're engineers who build the pipeline that makes painting buttons red measurable in the first place.
Who it's for
E-commerce on a headless stack
Next.js, Astro, Remix or SST in production. Performance is a KPI, client-side test tools aren't an option because they ruin the Lighthouse scores that carry your SEO.
B2B SaaS with pricing and onboarding tests
Trial funnels, pricing-page variants and activation flows need statistically valid tests, not heatmap gut feel. Bucket consistency across login state is non-negotiable.
Paid-media teams with landing-page tests
Running ads and need landing-page variants that compete cleanly. Per-channel evaluation with correct multi-variant correction.
Post-relaunch teams
Watching regressions or unexpected drop-offs after the relaunch. We build the switchback testing setup that lets you A/B against the pre-relaunch state.
Continuous loop
- 01
Data Audits
Conversion funnel validated for data quality.
- Event validation against the Measurement Blueprint
- Funnel completeness, cross-device
- Bot and sampling correction
Tools: PostHog, GA4 Exploration, BigQuery
- 02
Hypothesis Engineering
Prioritised hypotheses from the funnel analysis.
- ICE score (Impact, Confidence, Ease)
- MDE and required runtime calculated upfront
- Method choice: Bayesian or Frequentist
Per test: hypothesis doc in the Notion backlog
- 03
Edge Deployment
Server-side assignment before the first paint.
- Middleware handler in Next.js, Vercel Edge or Cloudflare Workers
- Hashed bucketing, persistent across sessions
- Exposure event to backend and BigQuery in parallel
Stack: GrowthBook, Statsig, Kameleoon SS
- 04
Bayesian / Frequentist Analysis
Significance with a documented method choice.
- SRM check before evaluation (Sample Ratio Mismatch)
- Confidence interval + effect size, not a bare p-value
- Entry into the Learnings Library, backlog reprio
Output: test report with action recommendation
The process: continuous validation, not a one-off relaunch lottery. Step 04 writes into the Learnings Library and reprioritises the backlog, which feeds Step 01 with new data.
Deliverables
Scope varies by engagement type. Full engagement includes:
Conversion data audit (Audit Sprint)
- Cross-device funnel analysis with drop-off rates per step
- Prioritised hypothesis backlog (ICE score, MDE, required runtime)
- Bot and sampling correction, exposure validation
- Audit report with concrete test recommendations for the next 90 days
Edge architecture (Build Sprint)
- Middleware handler (Next.js / Vercel Edge / Cloudflare Workers)
- Flag-service integration (GrowthBook EU-self-hosted, or Statsig)
- Exposure event pipeline to backend and BigQuery
- Bucket consistency cross-domain, pre- and post-login
Test operations (Managed Evolution)
- Test setup with documented runtime and stop criteria
- SRM monitoring and sequential-testing spending
- Evaluation with confidence interval and effect size
- Documentation in the Learnings Library
Reporting and handover
- Monthly test performance report
- Quarterly hypothesis reprio based on accumulated learnings
- Handover documentation for the internal team
- 30-day post-launch support
Engineering scope
Three parallel workstreams that together form a complete experimentation setup.
Analytics & Tracking
- Event-based measurement setup (PostHog, GA4)
- Funnel tracking cross-device, cross-domain
- Exposure logging in BigQuery (Frankfurt)
- Consent Mode V2 integration for exposure PII separation
Test Engineering
- Edge-based assignment setup (Vercel Edge, Cloudflare Workers, Next.js middleware)
- Feature-flag service via Statsig or GrowthBook (EU-self-hosted)
- Bucket persistence across session and login boundaries
- Kameleoon server-side or VWO server-mode for enterprise contracts
Data Science
- Cohort analysis by source, device, segment, LTV stratum
- Significance reporting (Bayesian + Frequentist side by side)
- SRM check, sequential-testing spending, multi-variant correction
- BigQuery ML models for pre-test stratification
Engagement depths
Three depths. Clear scopes.
No retainer trap.
Audit Sprint
We audit what's wrong. Report + prioritised action plan.
plus statutory VAT · fixed price for a clearly bounded scope
Included in the fixed price
- 1 domain
- 1 analytics property
- 1 tag manager / tracking setup
- 1 CMP
- up to 5 core conversions
- 10 working days
- PDF report + 90-min walkthrough
What you get
- Full analysis of your existing setup
- Prioritised report with concrete action items
- Walkthrough call with the team (90 min)
- No follow-up contract, no retainer obligation
When it fits
When the setup works but the numbers are being argued about internally. Or you're unsure what from a UA→GA4 migration still holds.
For e-commerce, multiple domains or App + Web: Audit Sprint Plus, €3,900 net fixed price. Bonus: 50 % of the Audit Sprint credits toward a Build Sprint commissioned within 30 days.
Request an Audit SprintBuild Sprint
Fresh build or restructure of a tracking setup.
plus statutory VAT · final fixed price after scope definition
Typical scope
- 1 domain (multi-domain on request)
- 1 analytics property (GA4 or Piwik PRO)
- server-side container (Stape or own cloud)
- 1 CMP with Consent Mode V2
- up to 15 events / conversions
- 4–8 weeks delivery
- Blueprint, QA sign-off, handover docs
What you get
- Measurement Blueprint for your dev team
- GTM + server-side setup incl. CMP integration
- Full QA against the blueprint with sign-off
- Handover docs + 30-day post-launch support
When it fits
When analytics is structurally broken and fixing it in-flight costs more than a clean rebuild.
Managed Evolution
Ongoing partnership. Analytics as a product, not a one-off project.
plus statutory VAT · monthly cancellation after the minimum term
Included in the monthly price
- up to 3 domains under active care
- GA4 + server-side stack maintenance
- monthly roadmap + sprint planning
- QA on every release deploy
- Slack channel, < 4 h response (Mon–Fri)
- monthly report + executive summary
- 3-month minimum, then monthly
What you get
- Monthly development + feature rollouts
- Ongoing QA on every deploy
- Executive reports + dashboard evolution
- Slack support with guaranteed response times
When it fits
When analytics has to grow with you (new campaigns, new products, new data sources) and you don't want to build that team internally.
All prices net, plus statutory VAT. For companies in Germany, Austria and Switzerland.
-
Why is server-side A/B testing mandatory in 2026?
Three forces make classic client-side testing untenable. First, Safari ITP 2.3 caps client-side cookies at 7 days, a bucketing state that decays after a week breaks every test. Second, ML-based adblockers filter 25 to 40 percent of exposure events, the power analysis runs against invisible sampling. Third, client-side variants force a FOUC before the snippet swaps the variant, measurably damaging CLS and LCP in Core Web Vitals. Server-side assignment at the edge avoids all three: bucketing happens on the server, exposure is logged in the backend, the browser receives the final variant without flicker.
-
Bayesian vs. Frequentist: which method do you use?
Both, depending on the question. Frequentist (Welch t-test, Mann-Whitney U) yields classical p-values and MDE power analysis, suitable for regulated tests and hard go/no-go decisions before launch. Bayesian (native in GrowthBook or Statsig) gives a direct probability statement ("Variant B is better with 94 percent probability") and allows sequential testing without alpha inflation, useful when the business case is asymmetric and peeking has to be allowed. We pick the approach during the hypothesis kickoff, not afterwards.
-
How do you integrate testing into a Next.js / headless stack?
Assignment logic runs in Next.js middleware or a Cloudflare Worker. Concretely: the request hits the edge layer, we hash the user ID (anonymous or consent-bound PII) against the test definition, write the bucket cookie, and route to the assigned variant. The exposure event goes to PostHog or GrowthBook in parallel, and to BigQuery for evaluation. Variant code stays with your dev team, we ship the edge handler, flag service and exposure SDK wiring. Works on Vercel, Cloudflare Pages, Netlify Edge and SST.
-
How do you prevent client-side flickering during tests?
Through architecture, not trickery. Client-side test frameworks like Google Optimize (sunset 2023), VWO Visual Editor or Optimizely Web inject CSS and DOM mutations after the first paint, producing Flash of Unstyled Content and a measurable CLS jump. With server-side assignment at the edge the browser receives the HTML of the assigned variant directly. No paint cycle, no mutation observer, no layout shift. Before rollout we document the CLS delta between control and variant and stop the test if the variant shows performance regression.
-
How much traffic does a site need for A/B tests?
Rule of thumb: at least 1,000 conversions per test variant over 4 weeks. Below that the test runs longer than the hypothesis stays valid, or returns no significance. With lower traffic we use qualitative methods, the obvious wins from the conversion audit and switchback designs at the marketing-channel level. We don't recommend tests that aren't structurally measurable, that's burning budget.
-
What is a Learnings Library?
A documented record of every test, won or lost. Each entry contains: hypothesis, test design, MDE, runtime, result with confidence interval, derived learnings. Prevents the same tests running twice, accelerates the next hypothesis by weeks. We host the library in Notion or Linear, ownership stays with you.
-
How long does an A/B test take?
At least 2 weeks to cover weekday and weekend effects. Typically 3 to 6 weeks to significance, depending on baseline conversion rate, MDE and traffic volume. We don't stop tests early because an interim result looks good. Peeking without alpha-spending destroys the statistical claim, one of the most common errors in CRO programmes.
-
What does an experimentation setup cost?
Audit Sprint at €2,400 net fixed price, 10 business days delivery. Includes conversion data audit, prioritised funnel leaks, test roadmap. Build Sprint for the edge architecture (middleware handler, flag service, exposure pipeline) from €7,500 net, depending on hosting setup. Managed Evolution monthly, from €4,000 net per month, covers hypothesis backlog, test operations and quarterly roll-up.
Next step
Where are you losing conversions today?
Audit Sprint at €2,400 net fixed price, 10 business days delivery. Conversion data audit, prioritised funnel leaks, test roadmap. No follow-on contract, no forced retainer.