Finding group of columns with row sum equal one

Question

0.00/5 (No votes)

See more:

I have a 2D array with ones and zeros, for example:

1 0 0 0
0 1 0 0
1 0 1 0
1 0 0 1

I need to find if there is a group of columns (or a single column) that have a row sum equal 1. For the above array the answer is YES because:

1 + 0 = 1
0 + 1 = 1
1 + 0 = 1
1 + 0 = 1

Edit:
This solution will work for much bigger arrays (for example 900x600). That's why I'm trying to avoid a brute force solution.

What I have tried:

Looking for a solution online and trying to find to come up with a better solution than brute forcing it.

Posted 10-Mar-22 5:51am

Member 15561967

Updated 10-Mar-22 8:08am

v2

Add a Solution

Comments

PIEBALDconsult 10-Mar-22 11:53am

More detail is required.

Member 15561967 10-Mar-22 12:00pm

What other detail do you need? I'd be happy to add it to the original post.

jeron1 10-Mar-22 11:56am

I would initially 'brute force' it, then try and optimize only if necessary. If you have specific questions during that process, feel free to ask them here.

Member 15561967 10-Mar-22 13:20pm

Brute force will be very slow for a bigger array. That's what I'm trying to avoid.

Rick York 10-Mar-22 12:16pm

Brute force is the appropriate thing to try in my opinion. There appear to be only eight possibilities so it's really not so bad. That is, four columns combined two at a time.

Member 15561967 10-Mar-22 13:22pm

Brute force will be very slow for a bigger array. That's what I'm trying to avoid. I've updated the question.

1 solution

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Luc Pattyn · Accepted Answer · 2022-03-10T08:08:00

Aha, That is a special case of a more general optimisation job I've spent many years on; I ended up with a very nice approach that is the opposite of brute force, and provides all solutions in one go.

Applied on your matrix situation, I'd like to describe it using boolean logic, i.e. all variables will have values 0 or 1, and the operators are AND (written as .), OR (written as +), and invert (written as /). Here we go:

call your columns A, B, C, D, ...
introduce variables a, b, c, d, indicating whether the corresponding column is or isn't used in the solution.

Now start building a, possibly huge, boolean expression TARGET, using the following pseudocode:

pseudo

TARGET=1
foreach row {
    expr = some expression describing the row (explained below)
    TARGET = TARGET . expr
}
now reduce the TARGET expression using three facts:
step 1 = explode: x.(y+z) => x.y + x.z
step 2 = discard zeroes: x./x => 0
step 3 = power reduction: x.x => x

you can merge these steps (i.e. perform steps 2 and 3 while executing step 1), for clarity I will show them separately in the above order

what remains is the list of all solutions, pick any one of the terms and make it 1.
Depending on the matrix content, you will get zero, one or several solutions.

Applied to your example:

TARGET= 1        // for starters
.a               // row 1 says you need a
.b               // row 2 says you need b
.(a./c + /a.c)   // row 3 says you need either a or c but not both
.(a./d + /a.d)   // similar for row 4

exploded gives four terms as there are two groups of 2 terms in the expression:
TARGET=a.b.a./c.a./d + a.b./a.c.a./d + a.b.a./c./a.d + a.b./a.c./a.d

zero discarding will drop the last three products as they all contain a./a
power reduction of the remaining product yields:

TARGET=a.b./c./d

which means you must include A and B and must not include C and D.

For more complex problems, a typical final expression could be:

TARGET = a.b./c./d + a./b./c.d.e + /a.b.c./d.e

remarks:

1. when processing (inside the foreach loop) a row that holds N ones, you need N products; each product would have one positive column and N-1 inverted columns. So if there were a fifth row with "1 0 1 1" that would contribute (a./c./d + /a.c./d + /a./c.d)

2. when the final TARGET would hold S products, that indicates there are S solutions; any combination of variables that makes one product true or one, is a solution (no matter what the other products contribute).

3. when a product in the final TARGET does not hold all column variables, that means the missing columns are don't care, which in your application means those columns contain all zeroes.

4. if all-zero rows are present, you'll get TARGET = 0, so no solution.

So far the logic or mathematics. How you code this I'll leave up to you.