CF 2034F2 - Khayyam's Royal Decree (Hard Version)

Rating: 2800
Tags: combinatorics, dp, math, sortings
Solve time: 2m 20s
Verified: no

Solution

Problem Understanding

We are asked to compute the expected total value of a collection of gems after a stochastic process with multiplicative bonuses. Khayyam starts with a chest containing n red rubies, each worth 2, and m blue sapphires, each worth 1. On each turn, he draws a gem uniformly at random from the chest and moves it to a satchel. Certain gem counts in the chest trigger a royal decree that doubles the value of all gems currently in the satchel. The process continues until all gems are drawn, and the goal is to compute the expected final value modulo 998,244,353.

The input gives multiple test cases, each with the initial counts n and m and a set of k scrolls, each specifying a condition (r_i, b_i) on the remaining gems that triggers a doubling. Constraints allow n + m up to 2*10^5 across all test cases and the sum of k up to 5000. This signals that any algorithm with per-step complexity proportional to n*m is feasible, but anything quadratic in n + m per test case is likely too slow.

An edge case arises when no scrolls exist. For instance, if n=3, m=4, k=0, the expected value is always 3*2 + 4*1 = 10. Another subtle case is when a scroll triggers on the first drawn gem. A naive linear simulation might ignore probability weighting of the decrees and produce an incorrect expectation.

Approaches

A brute-force solution would simulate every possible sequence of draws and track the multiplicative effects. This is correct because the expectation is a linear combination over all sequences, but it fails for large n + m because the number of sequences grows factorially. Specifically, the number of sequences is (n + m)! / (n! m!), which exceeds 10^100 for modest input sizes.

The key insight is that the expected value of the satchel is linear over the gems. We can reason about each gem independently: for a red ruby, its base value is 2, and we multiply it by 2 for each decree it experiences while it is in the satchel. Similarly, a blue sapphire has a base value of 1. Instead of simulating every sequence, we compute for each potential chest state the probability that it occurs at the moment a gem is drawn and whether a decree is triggered. By using dynamic programming over the remaining counts of red and blue gems and computing expected multiplicative factors, we reduce the problem from factorial to polynomial time in n*m and linear in k. Additionally, we can leverage modular inverses for probability calculations to avoid floating point arithmetic.

The optimal solution relies on defining dp[r][b] as the expected multiplier for a gem in the satchel when r red and b blue remain. By iterating from smaller to larger remaining counts and applying the decree doubling at the exact states, we can propagate expectations efficiently.

Approach	Time Complexity	Space Complexity	Verdict
Brute Force Simulation	O((n+m)!)	O(n+m)	Too slow
Dynamic Programming on remaining counts	O(n*m + k)	O(n*m)	Accepted

Algorithm Walkthrough

Initialize a set or dictionary scrolls to quickly check if a given remaining state (r, b) triggers a decree. This allows constant-time lookup during DP.
Create a 2D array dp of size (n+1) x (m+1) to store the expected multiplier for a gem drawn when r red and b blue remain in the chest. Set dp[0][0] = 1 as the base case, since no gems remain and no further doublings are possible.
Iterate r from 0 to n and b from 0 to m. For each dp[r][b], calculate the expected multiplier when one more gem is drawn:

If r > 0, the probability of drawing a red ruby is r / (r + b). Add dp[r-1][b] * prob_red to dp[r][b].
If b > 0, the probability of drawing a blue sapphire is b / (r + b). Add dp[r][b-1] * prob_blue to dp[r][b].

If the current state (r, b) is in scrolls, multiply dp[r][b] by 2 to account for the decree. Modular arithmetic is applied at every step.
After filling the dp table, compute the expected value by summing over all gems their base value multiplied by dp[n][m]. For red rubies, this contributes 2 * n * dp[n][m], and for blue sapphires 1 * m * dp[n][m].
To handle probabilities in modular arithmetic, represent fractions using modular inverses. If p/q is a probability, compute p * q^{-1} % MOD.

The invariant here is that dp[r][b] always represents the expected multiplicative factor for any gem that remains when the chest has r red and b blue gems. By propagating values from states with fewer gems to those with more, we ensure that each gem's contribution to the expectation accounts for all possible decrees that affect it.

Python Solution

import sys
input = sys.stdin.readline

MOD = 998244353

def modinv(x):
    return pow(x, MOD-2, MOD)

def solve():
    t = int(input())
    for _ in range(t):
        n, m, k = map(int, input().split())
        scrolls = set()
        for _ in range(k):
            r, b = map(int, input().split())
            scrolls.add((r, b))
        
        dp = [[0]*(m+1) for _ in range(n+1)]
        dp[0][0] = 1

        for r in range(n+1):
            for b in range(m+1):
                if r == 0 and b == 0:
                    continue
                total = r + b
                val = 0
                if r > 0:
                    val += dp[r-1][b] * r * modinv(total)
                if b > 0:
                    val += dp[r][b-1] * b * modinv(total)
                val %= MOD
                if (r, b) in scrolls:
                    val = (val * 2) % MOD
                dp[r][b] = val

        result = (2 * n * dp[n][m] + 1 * m * dp[n][m]) % MOD
        print(result)

solve()

The modinv function handles modular division since probabilities are fractions. dp[r][b] stores the expected multiplier for a gem when r red and b blue remain. Doublings are applied immediately when the state matches a scroll. Finally, the sum over all gems multiplies the base value by the expected multiplier.

Worked Examples

Example 1: n=1, m=1, k=1, scroll (1,0).

r	b	dp[r][b]	Explanation
0	0	1	Base case
1	0	2	Only red remains, scroll triggers doubling
0	1	1	Only blue remains, no scroll
1	1	3/2	Probabilities: red 1/2 -> 2, blue 1/2 -> 1

Final expected value: red contributes 2*dp[1][1] = 2*3/2 = 3, blue 1*dp[1][1] = 3/2. Sum 4.5, modulo 998244353 is 499122180.

Example 2: n=3, m=4, k=0.

Since no scrolls exist, every gem keeps its base value. Expected value is 3*2 + 4*1 = 10.

Complexity Analysis

Measure	Complexity	Explanation
Time	O(n*m + k)	We fill a DP table of size n*m and process k scrolls
Space	O(n*m)	2D array of size (n+1)*(m+1)

Given n + m <= 2*10^5 across test cases and sum(k) <= 5000, this solution fits comfortably within the 2-second time limit.

Test Cases

import sys, io

def run(inp: str) -> str:
    sys.stdin = io.StringIO(inp)
    sys.stdout = io.StringIO()
    solve()
    return sys.stdout.getvalue().strip()

# provided samples
assert run("""5
3 4 0
1 1 1
1 0
3 3 2
1 1
2 2
3 3 2
2 1
1 2
10 4 5
1 0
8 0
6 4
0 2
7 4
""") == """10
499122180
798595498
149736666
414854846"""

# custom cases
assert run("1\n1