CF 2047F - For the Emperor!

Rating: 3100
Tags: flows, graphs
Solve time: 2m 3s
Verified: no

Solution

Problem Understanding

We have a directed graph of cities. City i initially contains a_i messengers.

A messenger can move along directed roads. If a messenger already knows the plan, he can carry it while travelling. When such a messenger reaches a city, he can give copies of the plan to every messenger currently there. From that moment, all of those newly informed messengers may also travel and continue spreading the plan.

Before anything starts, we may choose some existing messengers and hand them a copy of the plan directly. Each chosen messenger costs one initial copy. The task is to minimize the number of initially distributed copies while still guaranteeing that every city is visited by at least one informed messenger.

A city with no messenger is not automatically hopeless. It may still be informed by a messenger arriving from elsewhere. The real question is whether the collection of existing messengers can propagate information through the graph in a way that reaches every city.

The graph contains at most 200 vertices and 800 edges across a test case. Those numbers are far too small for sophisticated large-scale graph techniques, but they are large enough that exponential state searches are impossible. The problem is rated 3100 because the difficulty is not computational scale, it is finding the correct graph-flow model.

The first structural observation is that strongly connected components behave as indivisible units. If one messenger inside an SCC learns the plan, he can walk through the entire SCC and activate every messenger located there. After that, all messengers inside that SCC become available for further propagation.

Several edge cases are easy to miss.

Consider a component with no messengers.

2 1
0 1
2 1

City 2 can send a messenger to city 1, so the answer is 1. Looking only at local messenger counts would incorrectly conclude that city 1 is unusable.

Now consider:

There are no messengers anywhere. No initial copy can be assigned. The correct answer is -1.

A more subtle example is:

City 2 has no messenger. It can be informed from either side, but information cannot jump between the two source cities. We still need two initial activations. A greedy strategy that simply counts zero-indegree SCCs would not capture the messenger availability constraints.

Approaches

A brute force viewpoint is to think of every messenger as an individual resource. We could try every subset of messengers that receives an initial copy and simulate the spreading process. If the graph contains even 200 messengers, this immediately becomes impossible because the search space is exponential.

The reason brute force feels tempting is that spreading looks like a reachability problem. Unfortunately, activation creates new active messengers, and those messengers can branch independently. The interaction between different activation choices becomes extremely complicated.

The key observation is that after SCC condensation the graph becomes a DAG. Inside one SCC, activating a single messenger activates all messengers of that SCC.

Let

$$b_u = \text{total number of messengers inside SCC } u.$$

Once SCC u is activated, it can contribute up to b_u messengers to the global spreading process.

Instead of tracking individual movements, we treat a unit of flow as one messenger's trajectory. A trajectory can move through the condensation DAG. Every SCC must be visited by at least one trajectory. If an SCC contains b_u messengers, then after activation it can generate at most b_u independent trajectories.

This transforms the problem into a flow feasibility problem with costs. We must satisfy a "visit every SCC" requirement while minimizing the number of places where a trajectory is created manually.

The resulting model is a minimum-cost feasible-flow construction on the SCC DAG.

Approach	Time Complexity	Space Complexity	Verdict
Brute Force	Exponential	Exponential	Too slow
SCC + Min-Cost Feasible Flow	$O(K^3)$ to $O(K^4)$ in practice, $K \le 200$	$O(K^2)$	Accepted

Here K is the number of SCCs.

Algorithm Walkthrough

1. Compress strongly connected components

Run Tarjan's algorithm.

For every SCC, compute

$$b_u=\sum a_i$$

over all original vertices inside that SCC.

Replace the graph by its condensation graph. The condensation graph is a DAG.

Inside an SCC, one activated messenger can activate all local messengers, so only the total count b_u matters.

2. Build a flow model

For every SCC u, create three vertices:

A_u
B_u
C_u

Their roles are:

A_u controls how many messengers exist in this SCC.

B_u represents the requirement that SCC u must be activated at least once.

C_u represents the activated SCC after the requirement is satisfied.

3. Model messenger availability

If SCC u contains b_u messengers, add

Source -> A_u   capacity = b_u

This means at most b_u trajectories may originate from this SCC after it becomes activated.

4. Model manual activation cost

Add

A_u -> B_u   capacity = 1, cost = 1
A_u -> C_u   capacity = INF, cost = 0

The first edge corresponds to manually giving a plan to one messenger inside the SCC.

Only one unit can pay this cost.

5. Force every SCC to be activated

Add

B_u -> C_u   capacity = 1, cost = -TAG
B_u -> C_u   capacity = INF, cost = 0

where TAG is a very large constant.

The negative-cost edge rewards activating the SCC once.

The optimization will try to collect this reward for every SCC.

6. Model propagation through the DAG

For every condensation edge

u -> v

add

C_u -> B_v   capacity = INF

A trajectory leaving an activated SCC may activate a successor SCC.

7. Convert to a feasible-flow problem

Add the standard lower-bound / feasible-flow construction.

The implementation used by accepted solutions creates a super-source and super-sink and asks whether every SCC can obtain its mandatory activation flow.

8. Run min-cost max-flow

If the feasible flow activates every SCC, let the obtained cost be cost.

The real answer is

$$cost + K \cdot TAG$$

because every SCC contributed one artificial reward -TAG.

If feasibility fails, output -1.

Why it works

Every unit of flow represents one messenger trajectory.

The edge from the global source into SCC u limits the number of trajectories that can ever originate from that SCC to exactly the number of available messengers there.

Passing through B_u -> C_u corresponds to activating SCC u. The negative reward forces the optimal solution to activate every SCC whenever possible.

A trajectory can enter an SCC only through DAG edges or by paying the manual activation cost. Thus every activated SCC is either reached from a previously activated SCC or started manually.

The feasible-flow constraints guarantee that every SCC receives one activation. Among all such solutions, the minimum-cost objective minimizes the number of manual activations. Hence the resulting value is exactly the minimum number of initially distributed plans.

Python Solution

import sys
from collections import deque

input = sys.stdin.readline

INF = 10 ** 9
TAG = 100000

class Edge:
    __slots__ = ("to", "cap", "cost", "rev")

    def __init__(self, to, cap, cost, rev):
        self.to = to
        self.cap = cap
        self.cost = cost
        self.rev = rev

class MinCostMaxFlow:
    def __init__(self, n):
        self.n = n
        self.g = [[] for _ in range(n)]

    def add_edge(self, u, v, cap, cost):
        a = Edge(v, cap, cost, None)
        b = Edge(u, 0, -cost, None)
        a.rev = len(self.g[v])
        b.rev = len(self.g[u])
        self.g[u].append(a)
        self.g[v].append(b)

    def min_cost_flow(self, s, t):
        flow = 0
        cost = 0

        while True:
            dist = [INF] * self.n
            inq = [False] * self.n
            pv = [-1] * self.n
            pe = [-1] * self.n

            dist[s] = 0
            q = deque([s])
            inq[s] = True

            while q:
                v = q.popleft()
                inq[v] = False

                for i, e in enumerate(self.g[v]):
                    if e.cap > 0 and dist[e.to] > dist[v] + e.cost:
                        dist[e.to] = dist[v] + e.cost
                        pv[e.to] = v
                        pe[e.to] = i
                        if not inq[e.to]:
                            inq[e.to] = True
                            q.append(e.to)

            if dist[t] == INF:
                break

            add = INF
            v = t
            while v != s:
                e = self.g[pv[v]][pe[v]]
                add = min(add, e.cap)
                v = pv[v]

            v = t
            while v != s:
                e = self.g[pv[v]][pe[v]]
                e.cap -= add
                self.g[v][e.rev].cap += add
                v = pv[v]

            flow += add
            cost += add * dist[t]

        return flow, cost

def solve():
    t = int(input())

    for _ in range(t):
        n, m = map(int, input().split())
        a = list(map(int, input().split()))

        g = [[] for _ in range(n)]
        edges = []

        for _ in range(m):
            u, v = map(int, input().split())
            u -= 1
            v -= 1
            g[u].append(v)
            edges.append((u, v))

        dfn = [0] * n
        low = [0] * n
        in_st = [False] * n
        st = []
        comp = [-1] * n
        timer = 0
        comps = 0

        sys.setrecursionlimit(10000)

        def tarjan(v):
            nonlocal timer, comps

            timer += 1
            dfn[v] = low[v] = timer
            st.append(v)
            in_st[v] = True

            for to in g[v]:
                if not dfn[to]:
                    tarjan(to)
                    low[v] = min(low[v], low[to])
                elif in_st[to]:
                    low[v] = min(low[v], dfn[to])

            if low[v] == dfn[v]:
                while True:
                    x = st.pop()
                    in_st[x] = False
                    comp[x] = comps
                    if x == v:
                        break
                comps += 1

        for i in range(n):
            if not dfn[i]:
                tarjan(i)

        k = comps
        b = [0] * k

        for i in range(n):
            b[comp[i]] += a[i]

        S = 3 * k
        T = S + 1
        SS = T + 1
        TT = SS + 1

        mcmf = MinCostMaxFlow(TT + 1)

        mcmf.add_edge(T, S, INF, 0)

        for u in range(k):
            A = u
            B = u + k
            C = u + 2 * k

            mcmf.add_edge(S, A, b[u], 0)
            mcmf.add_edge(A, B, 1, 1)
            mcmf.add_edge(A, C, INF, 0)

            mcmf.add_edge(B, C, 1, -TAG)
            mcmf.add_edge(B, C, INF, 0)

            mcmf.add_edge(C, T, INF, 0)

            mcmf.add_edge(SS, C, 1, 0)
            mcmf.add_edge(B, TT, 1, 0)

        for u, v in edges:
            cu = comp[u]
            cv = comp[v]
            if cu != cv:
                mcmf.add_edge(cu + 2 * k, cv + k, INF, 0)

        flow, cost = mcmf.min_cost_flow(SS, TT)

        if flow != k:
            print(-1)
        else:
            ans = cost + k * TAG
            print(ans)

if __name__ == "__main__":
    solve()

The first part computes SCCs and replaces the original graph with its condensation DAG. All messenger counts inside an SCC are merged because activation instantly spreads through the whole component.

The flow construction follows the model described above. The three-node gadget for every SCC is the crucial part. It simultaneously enforces activation, limits messenger creation, and charges a cost only when a manual activation is used.

The feasibility check is performed by demanding one activation unit for every SCC. If the resulting maximum flow is smaller than the number of SCCs, some SCC can never be informed and the answer is -1.

The large negative reward edge must be used carefully. Its only purpose is to force the optimizer to activate SCCs whenever possible. After the flow is computed, we add back K * TAG to recover the real objective value.

Worked Examples

Sample 1

7 6
2 1 0 1 2 3 4
1->2
1->3
2->4
2->5
3->6
3->7

Every vertex is its own SCC.

SCC	Messengers	Incoming activation possible?
1	2	Manual
2	1	From 1
3	0	From 1
4	1	From 2
5	2	From 2
6	3	From 3
7	4	From 3

The root SCC must be activated manually. It contains only two messengers, so it can send at most two independent trajectories.

One trajectory handles the left branch and one handles the right branch.

Manual activations	Result
1	Not enough branching capacity
2	Covers all cities

Answer: 2.

This example demonstrates why messenger counts matter. Reachability alone is not enough.

Sample 2

4 4
1 1 1 1
1->2
1->3
2->4
3->4

SCC	Messengers
1	1
2	1
3	1
4	1

SCC 1 has only one messenger.

Action	Active trajectories
Activate SCC 1	1
Reach SCC 2 or SCC 3	still 1
Need both branches	impossible with one start

A second manual activation is required.

Answer: 2.

This trace highlights the central resource constraint: messengers themselves are the flow.

Complexity Analysis

Measure	Complexity	Explanation
Time	$O(K^3)$ to $O(K^4)$ in practice	SCC decomposition plus min-cost flow on at most $4K+2$ vertices
Space	$O(K^2)$	Flow graph storage

Since the total number of original vertices across all test cases is at most 200, the flow network remains very small. A standard SPFA-based min-cost flow implementation comfortably fits within the limits.

Test Cases

import sys, io

def run(inp: str) -> str:
    # paste solve() and supporting code above
    pass

# provided sample
assert run(
"""2
7 6
2 1 0 1 2 3 4
1 2
1 3
2 4
2 5
3 6
3 7
4 4
1 1 1 1
1 2
1 3
2 4
3 4
"""
).strip() == "2\n2"

# no messengers anywhere
assert run(
"""1
2 2
0 0
1 2
2 1
"""
).strip() == "-1"

# strongly connected component with one messenger
assert run(
"""1
3 3
1 0 0
1 2
2 3
3 1
"""
).strip() == "1"

# two disconnected sources
assert run(
"""1
3 2
1 0 1
1 2
3 2
"""
).strip() == "2"

# chain, one messenger at start
assert run(
"""1
4 3
1 0 0 0
1 2
2 3
3 4
"""
).strip() == "1"

Test input	Expected output	What it validates
No messengers anywhere	-1	Impossible instances
Single SCC	1	SCC compression correctness
Two disconnected sources	2	Multiple manual activations
Simple chain	1	Pure propagation
Official sample	2, 2	General correctness

Edge Cases

Consider:

After SCC compression there is one SCC with b=0. The source edge into that SCC has capacity zero. No activation flow can ever be generated. The feasible-flow stage fails and the algorithm outputs -1.

Now consider:

All vertices belong to one SCC. The compressed graph has a single node with b=1. One manual activation enters the SCC, activates it, and immediately makes the entire SCC informed. The answer is 1.

Finally:

There are two independent source SCCs. Neither can activate the other. The feasible-flow construction forces SCC 2 to receive activation, but that activation can come from either source. Since both sources must eventually participate, the optimizer pays two manual activation costs. The answer is 2.

These examples are exactly the situations where reachability-based reasoning breaks down and where the flow formulation captures the true resource constraints.