EQUALHAMMING - Editorial

iceknight1093 · January 25, 2023, 4:14am

PROBLEM LINK:

Practice
Contest: Division 1
Contest: Division 2
Contest: Division 3
Contest: Division 4

Author: utkarsh_25dec
Testers: iceknight1093, rivalq
Editorialist: iceknight1093

DIFFICULTY:

TBD

PREREQUISITES:

Basic combinatorics

PROBLEM:

Given two binary strings A and B of length N, count the number of binary strings C of length N such that \text{dist}(A, C) = \text{dist}(B, C).
\text{dist}(A, C) denotes the hamming distance between A and C.

EXPLANATION:

Hamming distance is computed as the sum of N individual terms; one corresponding to each index. So, let’s see how each index affects the equality.

Consider an index i.

If A_i = B_i, then this index either contributes 0 to both \text{dist}(A, C) and \text{dist}(B, C) (if C_i = A_i); or contributes 1 to both (if C_i \neq A_i).
In other words, it doesn’t affect the equality at all, so we can freely choose C_i = 0 or C_i = 1 here.
If A_i \neq B_i, then depending on our choice of C_i, this index either contributes 1 to \text{dist}(A, C) and 0 to \text{dist}(B, C); or vice versa.
Clearly, we must choose \text{dist}(A, C) and \text{dist}(B, C) an equal number of times for the equality to hold in the end.

So, suppose there are K positions such that A_i \neq B_i; and N - K positions where they’re equal.
If K is odd, the answer is immediately 0, since as we noted we need to split these K positions equally. Let’s deal with even K now.

At \frac{K}{2} of the unequal positions, we must fix C_i to ensure that C_i \neq A_i.
At the other \frac{K}{2} of the unequal positions, we must fix C_i to ensure that C_i \neq B_i.
The remaining N-K positions are ‘free’, and each have two options.

There are \binom{K}{K/2} ways to choose K/2 positions out of K, and then the values at these K positions are fixed.
The other positions have 2^{N-K} options in total.
Multiplying everything together, the final answer is

\binom{K}{K/2} \times 2^{N-K}

Computing binomial coefficients under modulo requires the use of modular division: you can see how here.

TIME COMPLEXITY:

\mathcal{O}(N) per testcase.

CODE:

Setter's code (C++)

//Utkarsh.25dec
#include <iostream>
#include <cstdio>
#include <cstdlib>
#include <algorithm>
#include <cmath>
#include <vector>
#include <set>
#include <map>
#include <unordered_set>
#include <unordered_map>
#include <queue>
#include <ctime>
#include <cassert>
#include <complex>
#include <string>
#include <cstring>
#include <chrono>
#include <random>
#include <bitset>
#include <array>
#define ll long long int
#define pb push_back
#define mp make_pair
#define mod 1000000007
#define vl vector <ll>
#define all(c) (c).begin(),(c).end()
using namespace std;
ll power(ll a,ll b) {ll res=1;a%=mod; assert(b>=0); for(;b;b>>=1){if(b&1)res=res*a%mod;a=a*a%mod;}return res;}
ll modInverse(ll a){return power(a,mod-2);}
const int N=500023;
bool vis[N];
vector <int> adj[N];
long long readInt(long long l,long long r,char endd){
    long long x=0;
    int cnt=0;
    int fi=-1;
    bool is_neg=false;
    while(true){
        char g=getchar();
        if(g=='-'){
            assert(fi==-1);
            is_neg=true;
            continue;
        }
        if('0'<=g && g<='9'){
            x*=10;
            x+=g-'0';
            if(cnt==0){
                fi=g-'0';
            }
            cnt++;
            assert(fi!=0 || cnt==1);
            assert(fi!=0 || is_neg==false);

            assert(!(cnt>19 || ( cnt==19 && fi>1) ));
        } else if(g==endd){
            if(is_neg){
                x= -x;
            }

            if(!(l <= x && x <= r))
            {
                cerr << l << ' ' << r << ' ' << x << '\n';
                assert(1 == 0);
            }

            return x;
        } else {
            assert(false);
        }
    }
}
string readString(int l,int r,char endd){
    string ret="";
    int cnt=0;
    while(true){
        char g=getchar();
        assert(g!=-1);
        if(g==endd){
            break;
        }
        cnt++;
        ret+=g;
    }
    assert(l<=cnt && cnt<=r);
    return ret;
}
long long readIntSp(long long l,long long r){
    return readInt(l,r,' ');
}
long long readIntLn(long long l,long long r){
    return readInt(l,r,'\n');
}
string readStringLn(int l,int r){
    return readString(l,r,'\n');
}
string readStringSp(int l,int r){
    return readString(l,r,' ');
}
int sumN=0;
ll fact[N];
ll invfact[N];
ll inv[N];
void factorialsComputation()
{
    inv[0]=inv[1]=1;
    fact[0]=fact[1]=1;
    invfact[0]=invfact[1]=1;
    for(int i=2;i<N;i++)
    {
        inv[i]=(inv[mod%i]*(mod-mod/i))%mod;
        fact[i]=(fact[i-1]*i)%mod;
        invfact[i]=(invfact[i-1]*inv[i])%mod;
    }
}
ll ncr(ll n,ll r)
{
    ll ans=fact[n]*invfact[r];
    ans%=mod;
    ans*=invfact[n-r];
    ans%=mod;
    return ans;
}
void solve()
{
    int n=readInt(1,200000,'\n');
    sumN+=n;
    assert(sumN<=200000);
    string A=readString(n,n,'\n');
    string B=readString(n,n,'\n');
    int good=0,bad=0;
    for(int i=0;i<n;i++)
    {
        assert(A[i]=='0' || A[i]=='1');
        assert(B[i]=='0' || B[i]=='1');
        if(A[i]==B[i])
            good++;
        else
            bad++;
    }
    if(bad%2==1)
    {
        cout<<0<<'\n';
        return;
    }
    ll ans=power(2,good)*ncr(bad,bad/2);
    ans%=mod;
    cout<<ans<<'\n';
}
int main()
{
    #ifndef ONLINE_JUDGE
    freopen("input.txt", "r", stdin);
    freopen("output.txt", "w", stdout);
    #endif
    ios_base::sync_with_stdio(false);
    cin.tie(NULL),cout.tie(NULL);
    factorialsComputation();
    int T=readInt(1,1000,'\n');
    while(T--)
        solve();
    assert(getchar()==-1);
    cerr << "Time : " << 1000 * ((double)clock()) / (double)CLOCKS_PER_SEC << "ms\n";
}

Tester's code (C++)

// Jai Shree Ram  
  
#include<bits/stdc++.h>
using namespace std;

#define rep(i,a,n)     for(int i=a;i<n;i++)
#define ll             long long
#define int            long long
#define pb             push_back
#define all(v)         v.begin(),v.end()
#define endl           "\n"
#define x              first
#define y              second
#define gcd(a,b)       __gcd(a,b)
#define mem1(a)        memset(a,-1,sizeof(a))
#define mem0(a)        memset(a,0,sizeof(a))
#define sz(a)          (int)a.size()
#define pii            pair<int,int>
#define hell           1000000007
#define elasped_time   1.0 * clock() / CLOCKS_PER_SEC



template<typename T1,typename T2>istream& operator>>(istream& in,pair<T1,T2> &a){in>>a.x>>a.y;return in;}
template<typename T1,typename T2>ostream& operator<<(ostream& out,pair<T1,T2> a){out<<a.x<<" "<<a.y;return out;}
template<typename T,typename T1>T maxs(T &a,T1 b){if(b>a)a=b;return a;}
template<typename T,typename T1>T mins(T &a,T1 b){if(b<a)a=b;return a;}

// -------------------- Input Checker Start --------------------
 
long long readInt(long long l, long long r, char endd)
{
    long long x = 0;
    int cnt = 0, fi = -1;
    bool is_neg = false;
    while(true)
    {
        char g = getchar();
        if(g == '-')
        {
            assert(fi == -1);
            is_neg = true;
            continue;
        }
        if('0' <= g && g <= '9')
        {
            x *= 10;
            x += g - '0';
            if(cnt == 0)
                fi = g - '0';
            cnt++;
            assert(fi != 0 || cnt == 1);
            assert(fi != 0 || is_neg == false);
            assert(!(cnt > 19 || (cnt == 19 && fi > 1)));
        }
        else if(g == endd)
        {
            if(is_neg)
                x = -x;
            if(!(l <= x && x <= r))
            {
                cerr << l << ' ' << r << ' ' << x << '\n';
                assert(false);
            }
            return x;
        }
        else
        {
            assert(false);
        }
    }
}
 
string readString(int l, int r, char endd)
{
    string ret = "";
    int cnt = 0;
    while(true)
    {
        char g = getchar();
        assert(g != -1);
        if(g == endd)
            break;
        cnt++;
        ret += g;
    }
    assert(l <= cnt && cnt <= r);
    return ret;
}
 
long long readIntSp(long long l, long long r) { return readInt(l, r, ' '); }
long long readIntLn(long long l, long long r) { return readInt(l, r, '\n'); }
string readStringLn(int l, int r) { return readString(l, r, '\n'); }
string readStringSp(int l, int r) { return readString(l, r, ' '); }
void readEOF() { assert(getchar() == EOF); }
 
vector<int> readVectorInt(int n, long long l, long long r)
{
    vector<int> a(n);
    for(int i = 0; i < n - 1; i++)
        a[i] = readIntSp(l, r);
    a[n - 1] = readIntLn(l, r);
    return a;
}
 
// -------------------- Input Checker End --------------------

const int MOD = hell;
 
struct mod_int {
    int val;
 
    mod_int(long long v = 0) {
        if (v < 0)
            v = v % MOD + MOD;
 
        if (v >= MOD)
            v %= MOD;
 
        val = v;
    }
 
    static int mod_inv(int a, int m = MOD) {
        int g = m, r = a, x = 0, y = 1;
 
        while (r != 0) {
            int q = g / r;
            g %= r; swap(g, r);
            x -= q * y; swap(x, y);
        }
 
        return x < 0 ? x + m : x;
    }
 
    explicit operator int() const {
        return val;
    }
 
    mod_int& operator+=(const mod_int &other) {
        val += other.val;
        if (val >= MOD) val -= MOD;
        return *this;
    }
 
    mod_int& operator-=(const mod_int &other) {
        val -= other.val;
        if (val < 0) val += MOD;
        return *this;
    }
 
    static unsigned fast_mod(uint64_t x, unsigned m = MOD) {
           #if !defined(_WIN32) || defined(_WIN64)
                return x % m;
           #endif
           unsigned x_high = x >> 32, x_low = (unsigned) x;
           unsigned quot, rem;
           asm("divl %4\n"
            : "=a" (quot), "=d" (rem)
            : "d" (x_high), "a" (x_low), "r" (m));
           return rem;
    }
 
    mod_int& operator*=(const mod_int &other) {
        val = fast_mod((uint64_t) val * other.val);
        return *this;
    }
 
    mod_int& operator/=(const mod_int &other) {
        return *this *= other.inv();
    }
 
    friend mod_int operator+(const mod_int &a, const mod_int &b) { return mod_int(a) += b; }
    friend mod_int operator-(const mod_int &a, const mod_int &b) { return mod_int(a) -= b; }
    friend mod_int operator*(const mod_int &a, const mod_int &b) { return mod_int(a) *= b; }
    friend mod_int operator/(const mod_int &a, const mod_int &b) { return mod_int(a) /= b; }
 
    mod_int& operator++() {
        val = val == MOD - 1 ? 0 : val + 1;
        return *this;
    }
 
    mod_int& operator--() {
        val = val == 0 ? MOD - 1 : val - 1;
        return *this;
    }
 
    mod_int operator++(int32_t) { mod_int before = *this; ++*this; return before; }
    mod_int operator--(int32_t) { mod_int before = *this; --*this; return before; }
 
    mod_int operator-() const {
        return val == 0 ? 0 : MOD - val;
    }
 
    bool operator==(const mod_int &other) const { return val == other.val; }
    bool operator!=(const mod_int &other) const { return val != other.val; }
 
    mod_int inv() const {
        return mod_inv(val);
    }
 
    mod_int pow(long long p) const {
        assert(p >= 0);
        mod_int a = *this, result = 1;
 
        while (p > 0) {
            if (p & 1)
                result *= a;
 
            a *= a;
            p >>= 1;
        }
 
        return result;
    }
 
    friend ostream& operator<<(ostream &stream, const mod_int &m) {
        return stream << m.val;
    }
    friend istream& operator >> (istream &stream, mod_int &m) {
        return stream>>m.val;   
    }
};
#define NCR
const int N=1e6;
mod_int fact[N],inv[N];
void init(int n=N){
	fact[0]=inv[0]=inv[1]=1;
	rep(i,1,N)fact[i]=i*fact[i-1];
	rep(i,2,N)inv[i]=fact[i].inv();
}
mod_int C(int n,int r){
	if(r>n || r<0)return 0;
	return fact[n]*inv[n-r]*inv[r];
}

int solve(){
 		
               int n = readIntLn(1, 2e5);
               static int sum_n = 0;
               sum_n += n;

               assert(sum_n <= 2e5);

               string a = readStringLn(n, n);
               string b = readStringLn(n,n);
               for(auto &i: a){
                        assert(i == '0' or i == '1');
               }
               for(auto &i: b){
                        assert(i == '1' or i == '0');
               }
               int cnt = 0;

               for(int i = 0; i < n; i++){
               		cnt += a[i] != b[i];
               }
               if(cnt & 1){
               		cout << "0" << endl;
               		return 0;
               }
               cout << mod_int(2).pow(n - cnt) * C(cnt, cnt / 2) << endl;


                

 return 0;
}
signed main(){
    ios_base::sync_with_stdio(0);cin.tie(0);cout.tie(0);
    //freopen("input.txt", "r", stdin);
    //freopen("output.txt", "w", stdout);
    #ifdef SIEVE
    sieve();
    #endif
    #ifdef NCR
    init();
    #endif
    int t = readIntLn(1, 1000);
    while(t--){
        solve();
    }
    return 0;
}

Editorialist's code (Python)

mod = 10**9 + 7
def C(n, r):
    ret = 1
    for i in range(1, r+1):
        ret = ret * (n-i+1) * pow(i, mod-2, mod)
        ret %= mod
    return ret
for _ in range(int(input())):
    n = int(input())
    a = input()
    b = input()
    differ = 0
    for i in range(n):
        differ += a[i] != b[i]
    if differ%2 == 1: print(0)
    else: print(pow(2, n - differ, mod) * C(differ, differ//2) % mod)

tomb_123 · January 26, 2023, 5:44pm

so sad, I was applying kP(k/2) instead of kC(k/2). High time to improve my combination and permutation game. I always fall short there.

neel_04 · January 30, 2023, 3:09pm

please humbly asking the code editors, testers and setters not to put a ton of macros in the code, it doesn’t look nice and makes the code difficult to read

neel_04 · January 30, 2023, 3:10pm

btw where my code is wrong i am not getting it ? can anyone check

iceknight1093 · January 30, 2023, 4:09pm

The setter and testers will generally be focused on preparing the contest, and can’t be expected to code differently from their usual style. They also check the strength and validity of the testdata so there may be various asserts through their code, can’t do much about that.
As the editorialist, I never use macros anyway so there’ll always be at least one ‘clean’ implementation.

In this particular case I think the setter’s code is quite readable, if you just ignore the input template.

It’s generally not possible to debug when you only show a small part of the code, please make a submission to codechef or upload it to a site like https://p.ip.fi/ and share a link.

neel_04 · January 31, 2023, 5:05am

ok
this is the submission i did, its not working of a[i]!=b[i]
https://www.codechef.com/viewsolution/87438254

iceknight1093 · January 31, 2023, 5:31am

Three mistakes:

All the factorial/inverse factorial values are 0 because you forgot to call the factorial() function to initialize them
The inverse factorials are wrong, if you notice you’re computing invfact[i] = invfact[i-1] * i but that’s just factorials
The nCr function has the line ans%-mod, it should be ans%=mod

It’s actually not hard to debug these mistakes if you do it systematically.

If you notice, whenever you use factorials/inverse factorials the answer ended up being 0. This should tell you that something is wrong with that part of the code, so if you print out their first 10 values or so you’d have noticed that they’re all 0 which means they haven’t been computed.
Wrong computation of inverse factorials also would’ve been caught by this: if you printed out the first few values you’d have noticed that the factorials and inverse factorials were equal which is of course wrong.
The third mistake is harder to catch by yourself, but is something your compiler can catch for you if you set it up correctly. When I compiled your code I got the warning

/Code/Test.cpp:179:8: warning: value computed is not used [-Wunused-value]
  179 |     ans%-mod;

which is how I even noticed it. I recommend going through this blog, it’s quite useful.

neel_04 · January 31, 2023, 5:48am

thanks a lot

neel_04 · January 31, 2023, 6:26am

Can you check what is wrong it giving AC for 4tc but else all WA
Solution: 87442911 | CodeChef

iceknight1093 · January 31, 2023, 7:54am

ll ans = (one*two*three)%mod; has overflow because all three can be upto 10^9 so the product exceeds even the range of long long. Multiply two at a time.

neel_04 · January 31, 2023, 11:21am

Still isin’t passing all the test cases
https://www.codechef.com/viewsolution/87471086

iceknight1093 · January 31, 2023, 12:09pm

I think you need to be more careful with overflow.

ll ans = ((one%mod)*(two%mod))%mod;
ans *= (three%mod);

return ans;

After the first step, ans is less than mod.
In the second step, you’re multiplying it with something that’s also less than mod. But what about the result?

Edit: In fact, this is one of the main reasons people use a mod_int class: if you write it correctly once, you can abstract out all this stuff and only directly write the operations. You never have to write a %mod again.

You can see a simple example of this in the tester’s code: the main logic consists of purely additions/multiplications, all the mod operations are abstracted away.

neel_04 · January 31, 2023, 12:29pm

but still if it didn’t overflow how come it gave wrong answer??
in the submission section i found this, but it gave write?
https://www.codechef.com/viewsolution/87017540

iceknight1093 · January 31, 2023, 2:59pm

I’m trying to tell you your code does have overflow…

Try to follow the process of the program.

ll x = nCr(diff, diff/2, fact);
ll p = Modpower(2,equal);
ans = (x*p)%mod;

Here you’re multiplying x and p.
x comes from the nCr function, p comes from the Modpower function.

The Modpower function is ok, no issue there.
But, the nCr function has the three lines I pointed out in my previous comment:

ll ans = ((one%mod)*(two%mod))%mod;
ans *= (three%mod);
return ans;

The final value of ans is not modded!!! You’re multiplying two things and just returning that product. ans can be as large as 10^{18}.

This means the product (x*p) can be as large as 10^{18} \times 10^9. See the problem?

The fix is extremely simple, just return ans%mod instead of ans and you’ll get AC.

The last code you linked has no overflow issues because they use the mul function specifically to ensure that, of course it’s correct.

neel_04 · January 31, 2023, 3:10pm

Thanks a lot, finally got it, feels like i am not improving at all
Thanks a lot for clearing my doubt

eleric_123 · February 12, 2023, 8:11am

guys , can anyone help me out on why we have to use inverse multiplication when we use large prime number as modulo rather then a regular approach , i tried submitting it in a classical way but its giving me wrong answer
hers is the code :
https://www.codechef.com/viewsolution/89097817

iceknight1093 · February 12, 2023, 4:40pm

You need inverses because normal division makes absolutely no sense when dealing with modulos.

Take a simple example.
Let P = 10^9 + 7, the mod you see everywhere.

1 and 10^9 + 8 are the same thing modulo P, right?
What about \frac{1}{2} and \frac{10^9 + 8}{2}? Those are obviously not equal modulo P if you do normal division, right?
But they represent the same number modulo P, so dividing both by 2 should ideally also represent the same number modulo P — that’s generally what equality means.

Modular inverses are what allow you to keep this property of equality after division.

eleric_123 · February 17, 2023, 11:55am

Thanks bro , I am totally new to mod inverse but the explanation made it clear