Reconstructing Random Pictures Bhargav Narayanan and Corrine Yap Abstract. Given a random binary picture Pnof size n i.e. an nngrid filled

2025-04-29 0 0 439.64KB 23 页 10玖币

侵权投诉

Reconstructing Random Pictures

Bhargav Narayanan and Corrine Yap

Abstract.

Given a random binary picture

of size

, i.e., an

n×n

grid ﬁlled

with zeros and ones uniformly at random, when is it possible to reconstruct

from

its

-deck, i.e., the multiset of all its

k×k

subgrids? We demonstrate “two-point

concentration” for the reconstruction threshold by showing that there is an integer

(

)

∼

log n

)

1/2

such that if

k > kc

, then

is reconstructible from its

-deck

with high probability, and if

k < kc

, then with high probability, it is impossible

to reconstruct

from its

-deck. The proof of this result uses a combination of

interface-exploration arguments and entropic arguments.

1. Introduction

Reconstruction problems, at a very high level, ask the following general question:

is it possible to uniquely reconstruct a discrete structure from the “deck” of all its

substructures of some ﬁxed size? The study of such problems dates back to the graph

reconstruction conjectures of Kelly and Ulam [

], and analogous questions for

various other families of discrete structures have since been studied; see [

] for

examples concerning other objects such as hypergraphs, abelian groups, and subsets of

Euclidean space. The line of inquiry that we pursue here concerns reconstructing typical,

as opposed to arbitrary, structures in a family of discrete structures. Such questions,

best phrased in the language of probabilistic combinatorics, often have substantially

diﬀerent answers compared to their extremal counterparts; see [4,22], for instance.

Our aim in this paper is to investigate a two-dimensional reconstruction problem,

namely that of reconstructing random pictures. Before we describe the precise question

we study, let us motivate the problem at hand. Perhaps the most basic one-dimensional

reconstruction problem concerns reconstructing a random binary string from the multiset

of its substrings of some ﬁxed size, a problem intimately connected to that of shotgun-

sequencing DNA sequences; on account of its wide applicability, this question has

been investigated in great detail, as in [

] for instance. A natural analogue of the

aforementioned one-dimensional problem concerns reconstructing a random binary grid

(or picture, for short) from the multiset of its subgrids of some ﬁxed size; this is the

question that will be our focus here. While shotgun-reconstruction of random strings is

2020 Mathematics Subject Classiﬁcation. Primary 60C05; Secondary 60K35, 68R15.

Key words and phrases. reconstruction, random reconstruction, shotgun assembly.

arXiv:2210.09410v3 [math.CO] 28 Jan 2025

Figure 1. A picture of size 3 and its 2-deck.

a well-studied problem, there has been renewed interest — originating from the work of

Mossel and Ross [

] — in generalizations of this problem like the one considered here;

see [8,10,14,21] for some recent examples.

Writing [

] for the set

{

, . . . , n}

, a picture of size

is an element of

{

}[n]2

viewed as a two-coloring of an

n×n

grid using the colors 0 and 1. The

-deck of a

picture

of size

, denoted

(

), is the multiset of its

k×k

colored subgrids of which

there are precisely (

n−k

+ 1)

; see Figure 1for an illustration. We say that a picture

is reconstructible from its

-deck if

(

P′

) =

(

) implies that

P′

. Writing

for a random picture of size

chosen uniformly from the set of all pictures of size

, our primary concern is then the following question raised by Mossel and Ross [

when is

reconstructible from its

-deck with high probability? More speciﬁcally, is

there a threshold value for

such that in the supercritical regime,

is reconstructible

with high probability and in the subcritical regime

is not reconstructible with high

probability? It was previously determined by Mossel and Ross [

] (under the dual

perspective of coloring lattice vertices rather than faces) that the threshold lies between

p(1 −ϵ) log n

and

p(1 + ϵ)4 log n

, and Ding and Liu [

] subsequently narrowed down

the location of the threshold to (

p(1 −ϵ)2 log n, p(1 + ϵ)2 log n

). We note that these

results also hold in higher dimensions. We shall give a nearly complete answer to this

question. Let

(

) be the nearest integer value to

p2 log2n

. Writing

(

n, k

) for the

event that the random picture

is reconstructible from its

-deck, our main result is

as follows.

Theorem 1.1. As n→ ∞,we have

P(R(n, k)) →(0if k < kc(n),and

1if k > kc(n)

In other words, Theorem 1.1 shows that the “reconstruction threshold” of a random

picture is concentrated on at most two consecutive integers. The two results contained in

the statement of Theorem 1.1 are proved by rather disparate methods: the “0-statement”

follows from entropic considerations, while the “1-statement” is proved by an interface-

exploration argument, a technique more commonly seen in percolation theory. The

deﬁnition of

(

) ensures that

n2/

k2→ ∞

k < kc

(

) while

kn2/

k2−k→

0 if

k > kc(n), which is the main way we will use the deﬁnition of kc(n) in our arguments.

Let us brieﬂy mention that a related, but somewhat diﬀerent, two-dimensional

question about reconstructing “jigsaws” was raised by Mossel and Ross [

]. Following

work of Bordenave, Feige, and Mossel [

] and Nenadov, Pﬁster, and Steger [

], the

coarse asymptotics of the reconstruction threshold in this setting were independently

established by Balister, Bollob´as, and the ﬁrst author [

] and by Martinsson [

]. In

contrast, our main result establishes ﬁne-grained asymptotics of the reconstruction

threshold in the (somewhat diﬀerent) setting considered here.

This paper is organized as follows. We give the short proof of the 0-statement

in Theorem 1.1 in Section 2. The bulk of the work in this paper is in the proof of

the 1-statement in Theorem 1.1 which follows in Section 3. We conclude with some

discussion of future directions in Section 4.

Note added in proof: Subsequent to our paper’s appearance on the arXiv, Demidovich,

Panichkin, and Zhukovskii [

] extended our results to

colors and

dimensions for

r, d ≥3. While it is straightforward to verify that our computations allow for a larger

number of colors, we posited in a previous version of this paper that it would be

necessary to ﬁnd an appropriate higher-dimensional generalization of the interface

paths in our arguments in order to extend beyond

= 2. However, the methods of [

]

circumvent this need by careful modiﬁcation to our reconstruction algorithm.

2. Proof of the 0-statement

In this short section, we prove the 0-statement in Theorem 1.1 which, as mentioned

earlier, follows from considerations of entropy.

Proof of the 0-statement in Theorem 1.1.

An easy calculation shows that the deﬁnition

of kc(n) ensures that

n2/2k2→ ∞ (1)

n→ ∞

for every

k < kc

(

). Under this condition, we show that with high probability,

it impossible to reconstruct a random picture

of size

from its

-deck. The reason

is simple: under this assumption on

, the

-deck does not contain enough entropy

to allow reconstruction; for simplicity, we phrase this argument in the language of

counting.

First, the number of pictures of size

is 2

. Next, the number of such pictures that

are reconstructible from their

-decks is at most the number of distinct

-decks, which

is itself at most the number of solutions to the equation

x1+x2+· · · +x2k2= (n−k+ 1)2

over the non-negative integers, where

is the number of copies of the

k×k

grid in

D. It follows that

P[R(n, k)] ≤(n−k+ 1)2+ 2k2−1

2k2−12−n2≤10n2

2k22k2

2−n2.

It is now straightforward to check that

[

(

n, k

)]

→

0 by

(1)

. This proves the 0-

statement in Theorem 1.1.□

3. Proof of the 1-statement

A simple calculation shows that the deﬁnition of kc(n) ensures that

kn2/2k2−k→0 (2)

n→ ∞

for all

k > kc

(

). We shall show that if

(2)

holds, then

is reconstructible

from its

-deck with high probability. To accomplish this, we provide an algorithm for

reconstruction and bound the probability that

is not the output when this algorithm

is run on Dk(Pn).

To motivate our preliminary analysis, we ﬁrst give a rough description of the algorithm.

It begins by randomizing the deck and attempting to build the picture outward from

the ﬁrst element, in one direction at a time. It will begin by extending the starting

element to a small rectangle “naively,” which means it will simply place the ﬁrst deck

element that ﬁts. Then it will extend the rectangle by one column at a time, followed

by one row at a time. The starting rectangle size will be linear in

, small enough

that a union bound suﬃces to show the rectangle is likely a correct reconstruction.

However, continuing to place deck elements naively will not suﬃce to give a whp correct

reconstruction. Thus, in the row and column extension steps we select deck elements to

place more carefully, in a way that has a lower probability of failure.

In the next section, we provide some notation including precise deﬁnitions of “exten-

sion.” We then consider the particular types of subpictures and extensions that will

come into play in our algorithm and show that with high probability, the deck does not

contain elements that will extend “badly.” At last, we provide our algorithm and put

together all of the results to show that the probability the algorithm fails tends to 0

above the threshold.

3.1. Notation and Terminology. We ﬁrst set up some notation and conventions to

use throughout this section. As before, we will let

be the random picture we

wish to reconstruct from its

-deck

(

). A partial picture is a (not necessarily

rectangular) contiguous colored subset of an

n×n

picture. A grid or subgrid is a

rectangular partial picture, and a k-grid is a k×kgrid.

As in matrix notation, the coordinate (

i, j

) will denote the cell in the

th row from

the top,

th column from the left. For a partial picture

, the notation

(

i, j

) will

Figure 2. On the left is a grid

, in the center is a grid

, and on the

right is the extension of Sto the right by T.

denote the entry of cell (

i, j

), either 0 or 1. In our proof we may assume the elements

are distinct; indeed, we will show that the probability a given

-grid appears more

than once in

) in the supercritical regime. Thus, for a

-grid

, we may use

the notation

[

] to mean the cells of the

k×k

subgrid of

whose entries are identical

, as this is well-deﬁned, and similarly

[

(

i, j

)] to mean the entry of

in the

row and jth column of the k-grid P[T].

(More formally, we can introduce an injective map

D→P

mapping each deck

element

to the

k×k

subgrid of

identical to

; using this notion, then,

[

] refers

(

). To lighten the notational burden, we will not use this notion in the remaining

discussion but mention it here for the reader’s beneﬁt.)

We ﬁrst need the notion of “extending” a partial picture.

Deﬁnition 3.1. Given two

k×k

grids

with columns

s1, . . . , sk∈ {

and

with

columns

t1, . . . , tk∈ {

,we say that

extends

to the right if

si+1

for all

≤i≤k−

1. We call

itself an extension of

to the right and the resulting

k×

(

+1)

grid with columns s1, . . . , sk, tkthe extended grid.

See Figure 2for an illustration. We deﬁne extensions of a

-grid to the left, upwards,

and downwards analogously. We may also generalize this deﬁnition to larger partial

pictures, both rectangular and non-rectangular.

Deﬁnition 3.2. Given a partial picture

and a

k×k

subgrid

,we say that a

k×k

grid

extends

to the right at

if it is an extension of

by the above deﬁnition

and if the extension is consistent with all other entries of Q\S.

In other words,

extends

to the right at

if it is possible to place

as an

extension of Swithout creating any conﬂicts.

In the next subsection, we give some probability bounds that we will use in our

argument. The events that we deﬁne will play a role in the analysis of our reconstruction

algorithm described in Section 3.6, but we discuss the events here to make it clear that

they do not depend on the algorithm itself.

文档加载中……请稍候！
如果长时间未打开，您也可以点击刷新试试。

下载文档到电脑，查找使用更方便

10 玖币 0人已下载

立即下载

摘要：

ReconstructingRandomPicturesBhargavNarayananandCorrineYapAbstract.GivenarandombinarypicturePnofsizen,i.e.,ann×ngridfilledwithzerosandonesuniformlyatrandom,whenisitpossibletoreconstructPnfromitsk-deck,i.e.,themultisetofallitsk×ksubgrids?WedemonstrateΨwo-pointconcentration”forthereconstructionthreshol...

展开>> 收起<<

Reconstructing Random Pictures Bhargav Narayanan and Corrine Yap Abstract. Given a random binary picture Pnof size n i.e. an nngrid filled.pdf

共23页,预览5页

还剩页未读，继续阅读

声明：本站为文档C2C交易模式，即用户上传的文档直接被用户下载，本站只是中间服务平台，本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私，请立即通知玖贝云文库，我们立即给予删除！

Reconstructing Random Pictures Bhargav Narayanan and Corrine Yap Abstract. Given a random binary picture Pnof size n i.e. an nngrid filled

相关推荐

开通VIP享超值会员特权

作者详情

相关内容

热门标签

举报选择: