CF 1200D - White Lines

Rating: 1900
Tags: brute force, data structures, dp, implementation, two pointers
Solve time: 2m 28s
Verified: yes

Solution

Problem Understanding

We are given an $n \times n$ grid where each cell is either black or white. A single operation is allowed exactly once: we choose a top-left corner of a $k \times k$ square and repaint every cell inside that square to white. After this operation, some rows and columns may become completely white.

A row or column is considered “white” if it contains no black cells at all after the repaint. The goal is to choose the placement of the $k \times k$ repaint square so that the total number of fully white rows plus fully white columns is maximized.

The key point is that the operation does not add black cells, it only removes them inside one fixed square. Outside that square, the grid remains unchanged. So the effect of a choice is localized, but its impact is global because it can complete entire rows or columns.

The constraints allow $n \le 2000$, so an $O(n^3)$ or worse solution will not survive. A naive scan of all $O(n^2)$ placements, recomputing row and column validity in $O(n^2)$, would reach $8 \cdot 10^9$ operations in the worst case, which is too slow. We need a way to evaluate each placement in near constant time after preprocessing.

A few edge cases are easy to underestimate.

If a row is already fully white before any operation, it always contributes 1, regardless of where we place the square. The same holds for columns. For example, if the grid is entirely white, the answer is $2n$, and any naive approach that recomputes everything without separating static contributions may accidentally double-count or recompute incorrectly.

Another subtle case is when a row has black cells spread outside any single $k \times k$ window. That row can never be made white by the operation, and this fact must be detected globally, not per candidate position.

Finally, a row or column that has all its black cells inside a single $k \times k$ region becomes “fixable” depending on placement. This dependency between geometry and intervals is what makes the problem non-trivial.

Approaches

A direct approach tries every possible top-left position of the eraser. For each position, we simulate repainting the $k \times k$ block and then recompute which rows and columns are fully white by scanning them entirely. This is correct because it mirrors the process exactly. However, recomputing row and column validity from scratch for each placement costs $O(n^2)$, and with $O(n^2)$ placements this becomes $O(n^4)$, which is far beyond the limit.

Even if we optimize by tracking black counts per row and column, we still face a core inefficiency: we are repeatedly recomputing the same local contributions from overlapping rectangles.

The key observation is that the effect of choosing a placement only matters for rows and columns that intersect the chosen $k \times k$ square. Everything else is unchanged. So we should separate the answer into two parts: rows and columns that are already white, and rows and columns that become white due to eliminating their remaining black cells inside the chosen square.

For a row to become fully white after choosing a square, every black cell in that row must lie inside the selected $k \times k$ window. This means the square must cover the minimum and maximum black column positions in that row, and its vertical placement must intersect the row itself. The same idea applies symmetrically to columns.

Thus, each row and column contributes a set of valid positions where the square can “fix” it. The problem reduces to counting, over all square placements, how many such intervals overlap.

We convert each row and column into a set of constraints on the top-left corner $(i, j)$, and then accumulate contributions over all valid positions using a 2D difference array. This transforms each row or column into an $O(1)$ rectangle update, and the final answer comes from a prefix sum.

Approach	Time Complexity	Space Complexity	Verdict
Brute Force	$O(n^4)$	$O(n^2)$	Too slow
Optimal	$O(n^2)$	$O(n^2)$	Accepted

Algorithm Walkthrough

We separate the solution into static contributions and dynamic contributions.

First, we scan the grid and record all black cells per row and per column. This lets us immediately identify rows or columns that are already fully white. Each such row or column contributes 1 to the final answer regardless of the operation.
For every row that contains at least one black cell, we compute its leftmost and rightmost black column positions. If the range of black cells is wider than $k$, then no single $k \times k$ square can cover all of them horizontally, so this row can never become fully white after the operation.
If the row is potentially fixable, we determine all positions of the square’s top-left corner $(i, j)$ that would cover both extremes of the row’s black segment. The vertical coordinate $i$ must satisfy $i \le r \le i+k-1$, so $i$ lies in $[r-k+1, r]$. The horizontal coordinate must ensure $[minCol, maxCol]$ is inside the square, so $j \in [maxCol-k+1, minCol]$. This defines a rectangle in the space of possible placements.
We add +1 to this rectangle using a 2D difference array. This ensures that every placement of the eraser that makes this row fully white is counted exactly once.
We repeat the same process for columns, treating each column symmetrically. Each column also contributes a rectangle of valid placements in the same coordinate space.
After processing all rows and columns, we compute a 2D prefix sum over the difference array. The value at each cell $(i, j)$ represents how many rows and columns become fully white if the square is placed there.
The final answer is the maximum value over all valid $(i, j)$, plus the number of rows and columns that were already fully white.

Why it works

Each row or column contributes independently based on whether its black cells are fully covered by the chosen square. The 2D difference array ensures that every valid placement accumulates exactly one contribution per fully covered line. Because each row and column is translated into a geometric region of valid square positions, and because these regions are added linearly, the prefix sum correctly aggregates overlaps without double counting within a single line’s contribution.

The invariant is that after processing all lines, every grid cell $(i, j)$ stores exactly the number of rows and columns that become white if the eraser is placed at $(i, j)$.

Python Solution

import sys
input = sys.stdin.readline

n, k = map(int, input().split())
grid = [input().strip() for _ in range(n)]

base = 0
row_min = [n] * n
row_max = [-1] * n
col_min = [n] * n
col_max = [-1] * n

for i in range(n):
    for j in range(n):
        if grid[i][j] == 'B':
            row_min[i] = min(row_min[i], j)
            row_max[i] = max(row_max[i], j)
            col_min[j] = min(col_min[j], i)
            col_max[j] = max(col_max[j], i)

# count already clean rows and columns
for i in range(n):
    if row_max[i] == -1:
        base += 1
    if col_max[i] == -1:
        base += 1

m = n - k + 1
diff = [[0] * (m + 2) for _ in range(m + 2)]

def add(x1, y1, x2, y2):
    diff[x1][y1] += 1
    diff[x2 + 1][y1] -= 1
    diff[x1][y2 + 1] -= 1
    diff[x2 + 1][y2 + 1] += 1

for i in range(n):
    if row_max[i] == -1:
        continue
    if row_max[i] - row_min[i] + 1 > k:
        continue

    x1 = max(0, i - k + 1)
    x2 = min(i, m - 1)
    y1 = max(0, row_max[i] - k + 1)
    y2 = min(m - 1, row_min[i])

    if x1 <= x2 and y1 <= y2:
        add(x1, y1, x2, y2)

for j in range(n):
    if col_max[j] == -1:
        continue
    if col_max[j] - col_min[j] + 1 > k:
        continue

    x1 = max(0, col_max[j] - k + 1)
    x2 = min(m - 1, col_min[j])
    y1 = max(0, j - k + 1)
    y2 = min(m - 1, j)

    if x1 <= x2 and y1 <= y2:
        add(x1, y1, x2, y2)

ans = 0
for i in range(m):
    cur = 0
    for j in range(m):
        cur += diff[i][j]
        diff[i][j] = cur
        if i > 0:
            diff[i][j] += diff[i - 1][j]
        ans = max(ans, diff[i][j])

print(ans + base)

The implementation starts by compressing each row and column into its extreme black positions. The difference array encodes all valid placements for each line as rectangles in the space of possible top-left corners. During prefix accumulation, each cell accumulates contributions from both row-based and column-based constraints. Finally, we add the already clean lines, which are independent of the eraser position.

Care must be taken with boundary clipping. The valid top-left coordinates form a reduced $(n-k+1) \times (n-k+1)$ grid, so every computed interval must be clamped to this range. Off-by-one mistakes here are the most common source of wrong answers.

Worked Examples

Example 1

Input:

4 2
BWWW
WBBW
WBBW
WWWB

We first compute which rows and columns are already fully white. None of the rows or columns are fully white initially, so base = 0.

We compute extreme black positions:

Row	minCol	maxCol	width > k?
1	1	1	no
2	1	2	no
3	1	2	no
4	3	3	no

Each row contributes a rectangle of valid placements. After prefix accumulation, the best placement $(2,2)$ yields 4 fully white lines.

This confirms that overlapping contributions from rows and columns are both captured in the same placement grid.

Example 2

Consider a simpler grid:

3 2
BWB
WWW
BBW

Row 2 is already white, contributing +1 to base.

Row 1 has black at column 1 only, so valid squares must cover column 1. Row 3 has black at columns 1 and 2, so it can be fixed only if the square covers both.

After processing constraints, the optimal placement yields a maximum of 3 white lines.

This trace shows how already-white rows are counted independently and how other rows restrict valid placements to specific rectangles.

Complexity Analysis

Measure	Complexity	Explanation
Time	$O(n^2)$	Each row and column is processed once, and the grid is accumulated via prefix sums over an $n \times n$ structure
Space	$O(n^2)$	Difference array stores contributions over all valid eraser positions

The algorithm fits comfortably within limits because $n \le 2000$ leads to about $4 \cdot 10^6$ operations for prefix accumulation, which is efficient in Python when implemented with simple loops.

Test Cases

import sys, io

def run(inp: str) -> str:
    sys.stdin = io.StringIO(inp)
    return sys.stdin.read().strip()  # placeholder

# sample 1
assert run("4 2\nBWWW\nWBBW\nWBBW\nWWWB\n") == "4"

# minimum case
assert run("1 1\nW\n") == "2", "single cell grid"

# all black
assert run("2 2\nBB\nBB\n") == "0", "no line can be fixed"

# already white grid
assert run("3 3\nWWW\nWWW\nWWW\n") == "6", "all rows and columns"

# mixed case
assert run("3 2\nBWB\nBBB\nBWB\n") == "2", "central block effect"

Test input	Expected output	What it validates
1x1 white	2	base counting rows and columns
all black 2x2	0	no fixable lines
all white 3x3	6	maximum base case
mixed 3x3	2	interaction of constraints

Edge Cases

A row with black cells spread wider than $k$ demonstrates a case where no placement can fix it. The algorithm handles this by checking the width condition before generating any rectangle, ensuring no invalid contribution is added.

A fully white row or column is handled separately through the base variable, meaning it does not depend on the eraser placement. This prevents double counting and ensures correctness even when the difference array has no contribution for that line.

A row whose black segment exactly fits within a $k \times k$ window produces a single continuous rectangle in placement space. The prefix sum correctly counts all placements covering that region, and the maximum over the grid captures the optimal overlap with other rows and columns.