You are not logged in. Please login at www.codechef.com to post your questions!

×

SMARKET - Editorial

0
3

PROBLEM LINK

Practice
Contest

DIFFICULTY

medium

PREREQUISITES

segment tree, sqrt decomposition, offline processing of queries, binary search

PROBLEM

You are given an array $A$ of integers of size $N$. A block is defined as maximal consecutive subarray with equal values. For example, if $A = [20, 10, 10, 7, 7, 7, 10]$, then there are four blocks in the array, i.e. [20], [10, 10], [7, 7, 7] and [10].

You are given $Q$ queries, each consisting of $L, R, K$, you have to find number of blocks in the subarray $[L, R]$ that are of length at least $K$.

Solving first subtask

For solving first subtask, you can find the length blocks in a given subarray in the range $L$ to $R$ in a single iteration over the subarray $[L, R]$. Thus you can also find the count of blocks having length at least $K$ in $\mathcal{O}(|R - L + 1|)$ time, which can be $\mathcal{O}(n)$ in the worst case. So, the worst case complexity of answering all the queries will be $\mathcal{O}(n * Q)$, which will take around $3000^2 = 10^7$ operations, sufficient to run under a second which is good enough for the first subtask.

You can not afford to iterate over all the elements in the range of the query for answering it. You have to either somehow do some preprocessing or build some sort of data structure that can efficiently answer the query in time asymptotically better than $\mathcal{O} (n)$.

A smart solution of $\mathcal{O}(\log n)$ complexity per query

Let us take the example given in the statement, $[20, 10, 10, 7, 7, 7, 10]$. We will create another array $B$ as follows. $B_i$ will denote the number of suffixes of $A[1..i]$ that have equal elements. Intuitively, it denotes the length of the overlap of block consisting $A_i$ with $A[1..i]$. For our above example, the $B$ array will be $[1, 1, 2, 1, 2, 3, 1]$. If we would have to find the number of blocks with length at least $K$ in the entire array, we can notice that this will be precisely equal to number of $B_i$'s equal to $K$. This is true because if there is a block is of length $X$, then we will have written numbers $1, 2, 3, \dots, X$ written as $B$ values at the corresponding indices of block. So, this block will be considered for each of the $K$'s of the query where $K$ is in the range $1$ to $X$. In our example, you can see that there are two blocks of length at least 2 (i.e. the block [10, 10] and the block [7, 7, 7]), which is same as number of 2's in the array $B$. The number of blocks of length 1 are 3, which is also same as number of 1's in the array $B$.

Now, what if our query does not span the whole array? Let the query be generic range $[L, R]$. There can be three type of blocks that we will might have to consider.

- The blocks that lie completely inside.
- The left side block lying partially in range
- The right side block lying partially in the range.

For example, for the arary $[20, \textbf{20, 10, 10, 7, 7}, 7, 10]$ and the range $[2, 6]$, the block [10, 10] lies completely inside the range, and the left side block $[20, \textbf{20}]$ lies partially inside the range. Similarly the right stable block $[\textbf{7, 7}, 7]$ also lies partially inside the range.

For easiness in the explanation, let us assume that there are blocks of type 1 present in the query. Later we will learn how to deal with the other case too.

You can see that for the blocks that are completely inside the range, the number of $B_i$ equal to $K$ will give the correct number of blocks of length $K$ in the range $[L, R]$. What's the case with left and right side partial blocks? The contribution of right side block will also be counted correctly in this. This is because, all the values in the range $1, 2, \dots B_R$ are present in the range $[L, R]$. But for the left block, the $B_i$'s don't accurately represent the $B_i$ values if we would have considered the only the subrray $[L, R]$ in the consideration. In the later case, we should have started the $B$ values from $1$, $2$ onwards. For example, $B_2 = 2$, but that doesn't mean that block $[20]$ should be counted as of length 2. This block is of length 1. So, it could happen that the left side block is of not length at least $K$. We handle this case separately by checking whether the left side block is indeed counted in the answer when we calculate the number of $K$'s in the range $[L, R]$, and whether it should be really counted or not. Identifying which is the case is a matter of easy bookkeeping observations that can be made.

Now, remains the case when there is a no block that lies completely inside the range $[L, R]$. This special case is very easy to deal as one just have to check the lengths of left and right blocks.

For a given fixed array, the queries of finding the number of elements in a range $L, R$ that are equal to $K$ can be done in $\mathcal{O}(log n)$ time after preprocessing of the array. For each number $x$ in the array, you can create a sorted list of indices where $x$ appears. After that finding number of elements in the range $[L, R]$, will require you to do two binary searches to find the smallest index $\geq L$ and largest index $\leq R$.

Thus, each query will have $\mathcal{O}(\log n)$ time complexity. Overall time complexity of the solution will be $\mathcal{O}(Q \log n)$.

Alternate view of the problem.

Let us create another array $B$ that denotes the lengths of the blocks of the array $A$. Number of elements in $B$ will be equal to the number of blocks of the array $A$. For $A = [20, 10, 10, 7, 7, 7, 10]$, the array $B$ will be $[1, 3, 2, 1]$. From the previous discussion, you have seen that finding the answer for the blocks that completely lie inside the range is difficult. Processing the two left and right blocks is just a matter of small bookkeeping. So, we will see how to find the number of blocks of length at least $K$ that lie completely inside the range $[L, R]$.

Using two binary searches, we can identify the range $l, r$ such that $B[l..r]$ denote the length of blocks that lie inside the range $[L, R]$. We have to find number of elements in the range $[l, r]$ that are at least $K$.

So, we have to solve the following problem.

Given an array $a$. You have to queries $[l, r]$, find the number of elements in the range that are greater than or equal to $K$.

Solution using segment tree

The above problem can be solved by many standard data structures. You can create a segment tree whose each node will store the sorted list of the values in the range. Space and time complexity of creating this structure will be $\mathcal{O}(n \log n)$, where $n$ is number of elements in array $a$. Answering a query $K$ can be done by going through the nodes in the desired range and doing a binary search in each node of the segment tree to find the number of values in the node that are at least $K$. This algorithm provides you a time complexity of $\mathcal{O}({\log n}^2)$ per query.

Solution using sqrt decomposition.

You can also apply a sqrt decomposition algorithm for it. Divide the array $a$ into $\sqrt(n)$ buckets. For each bucket, also maintain a sorted list of values in the bucket. When you have to answer a query in the range $[l, r]$, there are three kind of scenarios that can exist. - The buckets that lie completely inside the range. + You can go through each of the buckets and for each bucket do a binary search to find the number of values in the bucket that are at least $K$. Number of suck buckets will be $\sqrt{n}$. - The buckets lie partially in the range. + There can be at most two such buckets, the left and right buckets that intersect with the range $[l, r]$. You can simply iterate over the values of these buckets that lie in the range and find the count of values that are at least $K$. As each bucket is of length $\sqrt(n)$. Its time complexity will still be $\mathcal{O}(\sqrt{n})$.

Time complexity of answering each query is $\mathcal{O}(\sqrt{n} \, \log{n})$.

This can be optimized to $\mathcal{O}(\sqrt{n})$ by making the following observation. As we have to answer queries offline, we are allowed to shuffle or sort them in any way. We sort them w.r.t $K$ values in the increasing order. For each bucket, we maintain a pointer that points to the current value $\leq k$ in the sorted list of values of the bucket. You can notice while processing a query this pointer will always move in the forward direction for a particular bucket. Hence, in the worst case, the above algorithm will give a amortized $\mathcal{O}(\sqrt{n})$ time complexity per query.

SETTER'S SOLUTION

Can be found here.

TESTER'S SOLUTION

Can be found here.

This question is marked "community wiki".

asked 15 Apr '17, 22:08

dpraveen's gravatar image

4★dpraveen ♦♦
2.5k52136170
accept rate: 20%

edited 05 Aug '17, 13:22

admin's gravatar image

0★admin ♦♦
19.7k350498541


Hi Guys!
Made a video editorial for this problem:
Stable Market - CodeChef April Long Challenge

Btw, I think is editorial is quite good, and it explains the approaches quite well. :-)

Cheers!

link

answered 02 May '17, 14:50

gkcs's gravatar image

4★gkcs
2.6k11127
accept rate: 9%

This problem can also be solved using MO's algorithm. see here for solution.

link

answered 17 Apr '17, 18:12

ashwanigautam's gravatar image

3★ashwanigautam
187213
accept rate: 0%

In the alternate view of problem-

Let us create another array B that denotes the lengths of the blocks of the array A. Number of elements in B will be equal to the number of blocks of the array A. For A=[20,10,10,7,7,7,10]the array B will be [1,3,2,1]
.

Should it not be [1,2,3,1]? Am I missing something?

link

answered 23 Apr '17, 13:05

vijju123's gravatar image

4★vijju123 ♦♦
15.2k11860
accept rate: 18%

Yeah, seems like a typo.

(02 May '17, 14:59) abdullah7685★

We can also use the offline method with segment tree.

link

answered 17 Apr '17, 15:46

mathecodician's gravatar image

5★mathecodician
2.6k1930
accept rate: 7%

edited 18 Apr '17, 14:58

Using MO's algorithm and segment tree in O((N+Q)* ROOT(N) * LOG(N)) time ?

(17 Apr '17, 18:13) ashwanigautam3★

No, not using the MO's algo, only segment tree with storing the nodes and the queries together.

(17 Apr '17, 18:20) mathecodician5★

I don't know why you haven't written about it yet :P

(18 Apr '17, 13:42) ashwanigautam3★
(18 Apr '17, 14:58) mathecodician5★

I did everything in O(n+q) by pushing queries into a vector at location l and r and processing with line sweep

Credits to bluescorp for sharing a unique observation, we both had a O(Q*logN) algorithm ready before we communicated. My solution(AC 0.23s): https://www.codechef.com/viewsolution/13370003

Do i get stars now? :D

(21 Apr '17, 21:26) yvetal_13★

i cant delete my comment so im doing this instead ;_; get my solution on the first page plx XD

(21 Apr '17, 22:34) yvetal_13★
showing 5 of 6 show all

I solved using Mo's algorithm and deque data structure.. It was very direct that way.

link

answered 17 Apr '17, 18:13

raja379's gravatar image

4★raja379
892
accept rate: 0%

can some one tell what B array the author is talking about and how to get it? I didn't understood at all.

link

answered 17 Apr '17, 18:37

narendra1897's gravatar image

3★narendra1897
111
accept rate: 0%

can someone tell me why my solution is showing TLE for second Subtask . (I did using Segment trees) ...Time per Query is O(logn) https://www.codechef.com/viewsolution/13332019

link

answered 17 Apr '17, 21:17

sagarpreet07's gravatar image

3★sagarpreet07
1
accept rate: 0%

I solved this using only Mo's Algorithm -> solution

Mo's Algorithm tutorial by Anudeep Nekkanti -> tutorial

link

answered 17 Apr '17, 22:48

dybbuk's gravatar image

3★dybbuk
473
accept rate: 20%

edited 17 Apr '17, 22:56

Square Root Decomposition + Binary Indexed Tree based Solution.

link

answered 18 Apr '17, 09:48

deep1996's gravatar image

5★deep1996
661
accept rate: 16%

I tried doing this using the segment tree, stored the sorted values of the range at each node and then tried searching, still getting TLE in second subtask. Can anyone please tell the reason,i couldnt find the mistake.. https://www.codechef.com/viewsolution/13307321

link

answered 18 Apr '17, 20:14

ambujverma8's gravatar image

4★ambujverma8
1
accept rate: 0%

@narendra1897 in log(n) approach like in first solution array B is nothing but comulative frequency of numbers suppose A[]={1,2,3} so B={1,1,1} A[]={1,1,1}so B[]=(1,2,3} in next case B[]={length of block}

link

answered 19 Apr '17, 09:00

forhadsidhu's gravatar image

2★forhadsidhu
1
accept rate: 0%

I used Segment tree but got TLE in second Subtask. Please,someone tell why is it happening. Here, is link to my code https://www.codechef.com/viewsolution/13312106

link

answered 19 Apr '17, 11:50

yaswant's gravatar image

2★yaswant
1
accept rate: 0%

The solution is accepted using Segment tree where each node is storing the sorted list of elements in the range (this is basically a merge sort tree ). Make sure to use vectors while creating segment tree nodes and don't pass vectors in function but instead pass their references. I got many TLE's because of it. My solution using Segment trees - Solution

(20 Apr '17, 16:25) studyonline3★

i am trying to traverse the array using MO's and keeping track of frequency using segment tree.I don't know what's wrong with my code b/c i tried and run my code on many testcases but thr result are perfectly fine.Can anyone help me find bug in my code link to solution->link text

link

answered 19 Apr '17, 12:33

fane_faiz's gravatar image

3★fane_faiz
1
accept rate: 0%

This part "For each number x in the array, you can create a sorted list of indices where x appears. After that finding number of elements in the range [L,R], will require you to do two binary searches to find the smallest index ≥L and largest index ≤R". What does it mean? Can someone please explain? I have created a segment tree where each node contains sorted scores from left to right of that segment. How do I compute the the correct number of elements between L and R? Thanks.

link

answered 21 Apr '17, 20:40

prokopton's gravatar image

0★prokopton
11
accept rate: 0%

edited 22 Apr '17, 15:50

I did everything in O(n+q) by pushing queries into a vector at location l and r and processing with line sweep

Credits to bluescorp for sharing a unique observation, we both had a O(Q*logN) algorithm ready before we communicated. My solution(AC 0.23s): https://www.codechef.com/viewsolution/13370003

link

answered 24 Apr '17, 20:54

yvetal_1's gravatar image

3★yvetal_1
11
accept rate: 0%

Can someone explain the comments in code written by author @pkacprzak.
How did he decomposed query by calling (solve-strictly-inside8block) && (solve-left-boundary)
Thanks in advance !

link

answered 19 Dec '17, 14:33

swamicoder's gravatar image

4★swamicoder
2347
accept rate: 10%

toggle preview
Preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported
  • mathemetical formulas in Latex between $ symbol

Question tags:

×15,498
×2,559
×1,727
×124
×10

question asked: 15 Apr '17, 22:08

question was seen: 4,611 times

last updated: 19 Dec '17, 14:33