We A/B Tested 12 Payment Configurations. Here's What Actually Moved the Needle

Most payment optimization decisions are made on incomplete data. A provider underperforms in one market, so the team shifts volume manually. A competitor claims better approval rates, so someone opens a new integration. A threshold gets adjusted after a bad week. None of it is tested. None of it is controlled. And the compounding cost of those guesses, across hundreds of thousands of transactions, is revenue that never appears on any report because no one tracks what was left on the table.
The right way to optimize is to test payment flows in production, systematically, against real traffic, with defined success criteria. This guide covers exactly that: what configurations to test, how to run them safely, and which of the twelve we ran actually moved the needle.
Why Testing Payment Configurations in Production Matters
Staging environments do not replicate live issuer behavior. A test card that approves cleanly in a sandbox may behave differently when routed through a real issuer in Germany or the Philippines at 11pm on a Friday. Production is the only environment where approval rates, latency, and cost reflect reality.
The problem is risk. Pushing an untested routing change to 100% of live traffic is how approval rates drop three percentage points before anyone notices. By the time the alert fires, or the Monday report surfaces the dip, days of revenue have already been lost.
Split routing solves this. It lets you send a defined percentage of live traffic to a new configuration while the remainder continues on the existing path. You get real transaction data, real issuer responses, and real cost signals, without exposing your full volume to an unproven change. Once a configuration proves out, you shift traffic. If it underperforms, you revert with a few clicks.
How to Structure a Payment Flow A/B Test in Production
Start with a clear hypothesis
Every test needs one variable and one success metric. Testing two variables simultaneously makes it impossible to attribute the result. A clean hypothesis looks like: "Routing Visa transactions originating from Indian BINs through Provider B instead of Provider A will increase approval rates by at least two percentage points."
The success metric should be specific: approval rate, authorization cost per transaction, or checkout-to-payment latency. Pick one. Secondary metrics can inform future tests.
Define your traffic split before you launch
Start conservative. A 90/10 split sends 10% of volume to the new configuration and limits downside exposure. Once the new configuration shows stable or improving results over three to five days, move to 70/30, then 50/50 if you want a cleaner statistical comparison. For lower-volume merchants, a 50/50 split from day one may be necessary to reach significance faster.
Do not run tests during promotional peaks or major sales events. Traffic composition changes during these periods, and results will not generalize to normal operating conditions.
Run for long enough to mean something
Seven to fourteen days of live traffic is the minimum for most configurations. Shorter tests produce noisy results that can lead teams to the wrong conclusion. If your transaction volume is high, seven days is often enough. If volume is moderate, extend to fourteen. Anything shorter is instinct with a confidence interval attached.
Segment your analysis by condition
An aggregate approval rate improvement may hide a loss in a specific segment. Always break down results by card brand, issuer country, currency, and transaction size before declaring a winner. A configuration that lifts Mastercard approvals in Europe by four points while reducing Visa approvals in Southeast Asia by two points is not a net win without careful math.
The 12 Configurations We Tested: What Moved the Needle
These tests were run across multiple merchant accounts processing at scale, using Yuno's split routing capability. Each test isolated a single variable. Results reflect live production traffic.
1. BIN-level routing by issuer country
Routing transactions to the provider with the strongest issuer relationship in the cardholder's country of origin produced consistent approval rate lifts. This was the highest-impact single variable we tested, with improvements ranging from two to five percentage points depending on the market. Merchants routing all traffic to a single provider regardless of BIN origin were consistently leaving approvals on the table.
2. Currency matching at the routing layer
Routing transactions in local currency to providers with local acquiring in that currency reduced decline rates caused by currency conversion friction. Markets where this had the largest effect included India, Nigeria, and Poland, where cross-border transaction decline rates from international providers were materially higher than local acquiring rates.
3. Card brand routing (Visa vs. Mastercard vs. local schemes)
Some providers have materially stronger approval rates for specific card brands. Routing Visa and Mastercard transactions to different primary providers, based on each provider's historical performance by brand, produced a measurable uplift. The effect was most pronounced for local card schemes: routing RuPay in India and Bancontact in Belgium to scheme-optimized providers closed a gap that generic routing missed entirely.
4. Time-of-day routing adjustments
Approval rates for certain providers drop during peak processing hours due to gateway capacity constraints. Routing away from congested providers between 12pm and 3pm local time, to providers with excess capacity during those windows, recovered transactions that would otherwise have declined with a generic processor error. The effect was small in aggregate but consistent across markets with high midday transaction volumes.
5. Transaction size banding
High-value transactions face more aggressive fraud scoring from some providers. Routing transactions above a defined threshold to providers with more permissive risk models, verified against fraud outcome data, lifted approval rates on large-ticket purchases without increasing chargeback rates. The key is validating that the permissive provider has comparable fraud outcomes, not just higher approvals.
6. Retry logic with a different provider on soft decline
This was the second-highest-impact configuration. When a transaction returns a soft decline from the primary provider, automatically retrying through a secondary provider recovers a meaningful share of those transactions. Merchants using Yuno's fallback routing recover an average of 8% of transactions that would otherwise have failed. The critical variable is the retry provider: retrying with the same provider on a soft decline produces near-zero recovery.
7. Payment method expansion by market
Adding a locally preferred payment method as a checkout option, then measuring its adoption and approval rate against card-only flows, consistently showed higher conversion in markets where that method dominates. In the Netherlands, adding iDEAL as the default option for Dutch users lifted checkout completion rates. In Kenya, adding M-Pesa removed a friction point that card-only flows could not address. The test here is not routing logic but checkout composition.
8. Tokenization vs. raw card on recurring transactions
Recurring transactions routed using network tokens had higher approval rates than those sent with raw card data, particularly after card reissue events. Tokens survive card number changes in many networks, meaning a subscription renewal that would have declined on a reissued card instead approves. This was particularly impactful for merchants with high recurring transaction volumes in North America and Europe.
9. 3DS routing by risk score
Applying 3DS selectively, rather than universally, based on transaction risk score reduced checkout friction on low-risk transactions without materially increasing fraud exposure. The test compared universal 3DS against risk-tiered 3DS and found that the risk-tiered approach maintained fraud rates within acceptable thresholds while reducing step-up friction on the majority of transactions.
10. Cost-optimized routing for low-margin transaction types
Routing low-margin transaction categories to the lowest-cost provider, rather than the highest-approval-rate provider, improved net revenue per transaction. This requires defining a minimum acceptable approval rate floor and routing to the cheapest provider that meets it. The configuration is simple in principle but requires clean cost data per provider, which many merchants do not have in a single view.
11. Geo-based failover to regional providers
When a primary provider shows latency spikes or elevated error rates in a specific region, automatically routing that region's traffic to a regional backup provider maintained approval rates during incidents that would otherwise have caused transaction failures. This is less an optimization test than an infrastructure resilience test, but the revenue impact during provider outages is significant.
12. Checkout field optimization by market
Reducing the number of required checkout fields for markets where certain data points are not used in issuer authorization decisions reduced form abandonment without changing downstream approval rates. Removing the billing address field in markets where issuers do not use AVS checks eliminated a friction point that had no approval rate benefit. This is a checkout UX test, not a routing test, but it belongs in any payment optimization program.
What Did Not Move the Needle
Three configurations produced no statistically significant improvement and are worth flagging to save time.
Routing by device type (mobile vs. desktop) showed no consistent approval rate difference once BIN and card brand were already accounted for. Time-zone-based routing, separate from time-of-day routing, produced no measurable effect. And rotating between providers purely on a percentage basis, with no condition logic attached, produced random variation rather than improvement. Uninformed splits are not optimization.
How Yuno's Smart Routing Makes This Testable Without Engineering
Running twelve configurations in parallel requires infrastructure that most payment stacks cannot support without significant engineering work. Yuno's smart routing engine lets payment teams configure split tests, adjust traffic percentages, and set routing conditions through a no-code interface. There is no engineering dependency to create or update a routing rule.
The routing engine selects the optimal payment path based on real-time data and historical performance, prioritizing approval rate, cost, or latency according to the merchant's defined objective. Condition-based routing supports any attribute: BIN, country, card brand, currency, payment method, transaction size, or custom logic. Split routing for A/B tests is built in, with automatic retries for failed payments handled without manual intervention.
inDrive used this capability to reach 90% payment approval rates across 50+ countries. Reserva achieved a four-point increase in approval rates in under three months. Livelo recovered 50% of previously failed transactions. None of these results came from a single configuration change. They came from a systematic optimization program built on controlled tests.
How to Prioritize Which Tests to Run First
Not every merchant needs to run all twelve configurations. Start where the data points to the largest gap.
If approval rates vary significantly by market, BIN-level routing and currency matching are the highest-priority tests. If you have a high volume of recurring transactions, tokenization testing should come early. If failed payment recovery is the primary objective, retry logic with a secondary provider delivers the fastest measurable result.
A practical starting point: audit your approval rates by card brand and issuer country for the last 90 days. Identify the three markets or card types with the largest gap between your current rate and the 90% benchmark that merchants with optimized routing achieve. Those gaps are your test backlog.
The Practical Takeaway for Payment Leaders
Testing payment flows in production is not optional at scale. Merchants processing significant volume across multiple markets are making routing decisions constantly, either deliberately through tested configurations or implicitly through static rules that no one has validated in months.
The configurations that moved the needle most consistently were BIN-level routing, soft-decline retry logic, and tokenization for recurring transactions. Together, these three account for the majority of recoverable approval rate improvement available through routing optimization.
Start with one test, one variable, and a fourteen-day window. Move traffic to the winner. Then run the next test. Payment optimization compounds: each percentage point recovered funds the next round of infrastructure investment.
Yuno's smart routing gives payment teams the infrastructure to run these tests without engineering bottlenecks, with full visibility into performance across every provider in a single dashboard. The results are real. The process is repeatable. The only requirement is starting.



.png)

%20(1)%20(1).png)