SUBLCM - editorial

devuy11 · September 22, 2014, 12:14am

PROBLEM LINK:

[Practice][1]
[Contest][2]
Author: [Lalit Kundu][3]
Tester: [Tasnim Imran Sunny][4]
Editorialist: [Devendra Agarwal][5]

DIFFICULTY:

Medium

PREREQUISITES:

LCM and Dynamic Programming

PROBLEM:

Given an array A1,A2…AN you have to print the size of the largest contiguous subarray such that LCM of all integers in that subarray is equal to the product of all integers in that subarray.

Solution:

Problem Idea: Calculating efficiently longest subarray which ends at i for each i. LCM of all integers in that subarray is equal to the product of all integers in that subarray.

Let Dp[i] denotes the length of the longest subarray ending at i and satisfying the property that the product of all integers of the subarray is LCM of the subarray.

If we know Dp[i-1] , can we calculate Dp[i] ?

Yes , Dp[i] = min ( Dp[i-1]+1 , i - Cal(i) ) (Array indexing starts from 1) , where Cal(i) calculates the last index encountered which is not co-prime to A[i] while travelling left from i. If no such index is found , then 0 is returned.

Let A[] = 2 3 5 4 9 5 10 , then Cal(i) for i=1,…7 will be 0,0,0,1,2,3,6.

Dp[] = 1 2 3 3 3 3 1

Ans=max(Dp)=3

How to calculate the Cal(i) efficiently ?

A number is not co-prime to another number if there exist a common gcd [ or a common prime factor ].

Ok, then we need to factorise A[i+1] , let say we wrote A[i+1] = p₁^e₁ * p₂^e₂ * p₃^e₃ * … * p_k^e_k , where p_i is prime number.

Now we need to pick up the latest index where any of these factors came.

But how can we do it efficiently ?

Look at the constraints and you will notice that A[i] <= 10⁶ , so each prime is less than 10⁶

We can use an array of size 10⁶{let’s call it P[]} to find out the latest position. Look at the pseudo code for Cal(i).

Pseudo Code


	int Cal(int idx)
		int ret_val = 0
		Factors = Factorise(A[idx]) //Factorise will only return prime factors
		for(int i = 0 ;i< Factors.size() ;i++ )	//iterating through all factors
			ret_val = max( ret_val , P[Factors[i]] )  //P is an array of size 10<sup>6</sup>
			P[Factors[i]] = idx			//latest index for prime number Factors[i] is updated to idx.
		return ret_val

What will be our answer ?

Max ( dp[i] ) for all i

Some PitFalls

You need to precompute factors of each numbers [ from 1 to 10⁶ ]. Reason : Calculating it in run time may lead to TLE because :
For factoring any number , you need to iterate over all prime numbers less than sqrt(number). In worst case this number is close to 150. Hence total number of steps will be approximately 150 * 100000 for each Test case. So, for 50 test cases this goes to 75 * 10⁷ which will time out.

Another Way of understanding the Solution
Actually you can also interpret the solution as an application of two pointer method.

Two pointer method
Assume that [i, j] is the your current valid segment/ window. If you go from i to i + 1, and the segment [i + 1, j] still remains a valid segment, then you can apply this method.
So you can consider that when going from left to right, if you maintain a pointer for the right end of the good segment, then the right end of the segment always keeps on increasing.

Basically in other words, you can treat the segment as a window representing a valid solution. You need to support fast addition and deletion of elements from the window. But you will notice that
if your right pointer always increases then you will never need to delete the right end of the segment. You will always delete the left end and will always add the right end of the segment.

It is very easy to prove that complexity of this method is O(n), as that right pointer is always increasing and hence can make at most n iterations, So overall there could be at most n + n iterations over both left and right ends of a good segment which amounts to total O(n) operations. This method is called window two pointer method.

Sometimes this method is called sliding window algorithm too.

How to apply it in our problem
You can notice that if the segment [i, j] is a valid segment (ie. a segment containing numbers having their lcm equal to their product), then the segment [i + 1, j] is also valid.
When we want to extend the right end of our current segment (ie. we are looking at validity of segment [i, j + 1] considering that we know that segment [i, j] is valid), we need to check whether the current number has any common divisor (prime divisor will also work) with any number in the window. This check can be done easily by using the Cal array as explained above.

Overall complexity of this method is O(N).

AUTHOR’S and TESTER’S SOLUTIONS:

[Author’s solution][6]
[Tester’s solution][7]
[1]: SUBLCM Problem - CodeChef
[2]: CodeChef: Practical coding for everyone
[3]: darkshadows | CodeChef User Profile for Lalit Kundu | CodeChef
[4]: rustinpiece | CodeChef User Profile for Tasnim Imran Sunny | CodeChef
[5]: devuy11 | CodeChef User Profile for Devendra Agarwal | CodeChef
[6]: http://www.codechef.com/download/Solutions/COOK50/Setter/SUBLCM.cpp
[7]: http://www.codechef.com/download/Solutions/COOK50/Tester/SUBLCM.cpp

rishul_nsit · September 22, 2014, 12:18am

Extremely strict time limit!! There should not have been any need to pre-compute factors of every number beforehand. Good question ruined only by the imposition of tight time limit!

sikander_nsit · September 22, 2014, 1:06am

I used sieve to calculate the smallest prime factor(spf) of every number less than 10^6. Then I was calculating the prime factors for each number using the method of repeatedly dividing by the spf in a loop until the number is greater than 1. For this loop, most number of iterations will be 20 for 2^20. I used a set to store the prime factors which would balance out as when number of distinct prime factors would be more, the number of loop iterations would be less and vice versa. So the worst case complexity for single test case would be 20*(10^5) = 2*(10^6). Then I was using the two pointer method that was explained above. I got TLE using this approach. I would like to know if there was a test file in which all test cases were identical?

[Edit]

“Yes, there was a file in which all the test cases were identical. It was n = 10^5 and all numbers being 720720.”

I don’t have much experience in problem setting but I don’t think this is a good practice. The primary purpose of including ‘t’ in the problem is that complete testing can be done in less number of files. Usually ‘t’ is not included in the time complexity. Repeating the test case 50 times means you are essentially making n=5*(10^6). So in most problems time limit is such that if the solution can pass the worst test case within the time limit, it should be able to pass all test files. Even when all test cases have the maximum limits, t is usually less than equal to 10. 50 makes the time limit extremely strict. Maybe one of the more experienced problem setters can comment on this.

rahul_nexus · September 22, 2014, 1:48am

Shouldn’t this be Dp[i] = max(Dp[i-1]+1 , i+1 - Cal(i) ) instead of Dp[i] = min ( Dp[i-1]+1 , i+1 - Cal(i) )

betlista · September 22, 2014, 1:54am

I did very the same as @sikander_nsit described, any generator for a worst case ?

My solution is here - CodeChef: Practical coding for everyone complexity in comments…

I used map to represents polynomial, for example 18 = 2 * 3 * 3 = 2 * 3² => { 2=1, 3=2 }

shalini11 · September 22, 2014, 4:33am

To Editorialist : Some accepted solutions are giving wrong answer for case :
2 46093 92186 .
Please consider this case .

cbgiri_001 · September 22, 2014, 10:23am

Please anyone expalin how is this working
Dp[i] = min ( Dp[i-1]+1 , i+1 - Cal(i) )

akumar3 · September 22, 2014, 11:16am

Kudos to Tester’s solution…very readable!
Thanks…

jaskaran_1 · September 27, 2014, 4:03pm

Could somebody explain me this in the Tester’s solution:

if(i)
       {
           best[i] = MAX(best[i-1],x);
           ans=MAX(ans,i-best[i]);
       }
       else
         best[i]=-1;
   }

And how does it function similar to

dp[i]=min(dp[i-1]+1,i-Cal(i))

caioaao · October 14, 2014, 8:54pm

I’m getting WA and have no idea why. Here is my submission:
www.codechef.com/viewsolution/5149059

Can someone help me/provide tricky test cases? I wasn’t able to find any;

gchandel6 · May 3, 2015, 11:23am

Easy with sliding window algorithm…(or kadanes algorithm)

georgejoseph · September 23, 2015, 3:56pm

Why can’t we use normal gcd function in Cal(i)?
I agree it increase the time.But its still in limit,right?

spinxo · November 24, 2015, 9:38pm

a really nice problem

lakshmi8 · January 25, 2016, 8:59pm

Great problem and a great editorial _/\_

rishul_nsit · September 22, 2014, 12:19am

solution links broken…

dpraveen · September 22, 2014, 12:22am

You don’t need to compute all the factors, You can only work with prime factors.

rishul_nsit · September 22, 2014, 12:26am

I didn’t mean all factors. Prime factorization of every number could be calculated on the fly instead of pre-computation. Nevertheless learnt something from the question so no qualms
PS: There is no python solution which clearly given an idea about the strict time limit.

dpraveen · September 22, 2014, 12:30am

You can compute on the fly but it’s time complexity would be different.

rishul_nsit · September 22, 2014, 12:46am

Can you provide a worst case for computing on the fly? Provided I cache them as and when I calculate them. That is I don’t compute the factors of any number more than once.

dpraveen · September 22, 2014, 1:29am

Yes, there was a file in which all the test cases were identical. It was n = 10^5 and all numbers being 720720.