Determine whether the two classes are linearly separable (algorithmically in 2D)

algorithm math machine-learning classification

10,246

Solution 1

If you found the convex hull for both the X points and the O points separately (i.e. you have two separate convex hulls at this stage) you would then just need to check whether any segments of the hulls intersected or whether either hull was enclosed by the other.

If the two hulls were found to be totally disjoint the two data-sets would be geometrically separable.

Since the hulls are convex by definition, any separator would be a straight line.

There are efficient algorithms that can be used both to find the convex hull (the qhull algorithm is based on an O(nlog(n)) quickhull approach I think), and to perform line-line intersection tests for a set of segments (sweepline at O(nlog(n))), so overall it seems that an efficient O(nlog(n)) algorithm should be possible.

This type of approach should also generalise to general k-way separation tests (where you have k groups of objects) by forming the convex hull and performing the intersection tests for each group.

It should also work in higher dimensions, although the intersection tests would start to become more challenging...

Hope this helps.

Solution 2

Computationally the most effective way to decide whether two sets of points are linearly separable is by applying linear programming. GLTK is perfect for that purpose and pretty much every highlevel language offers an interface for it - R, Python, Octave, Julia, etc.

Let's say you have a set of points A and B:

enter image description here

Then you have to minimize the 0 for the following conditions:

(The A below is a matrix, not the set of points from above)

enter image description here

"Minimizing 0" effectively means that you don't need to actually optimize an objective function because this is not necessary to find out if the sets are linearly separable.

In the end ( enter image description here ) is defining the separating plane.

enter image description here

In case you are interested in a working example in R or the math details, then check this out.

Solution 3

Here is a naïve algorithm that I'm quite sure will work (and, if so, shows that the problem is not NP-complete, as another post claims), but I wouldn't be surprised if it can be done more efficiently: If a separating line exists, it will be possible to move and rotate it until it hits two of the X'es or one X and one O. Therefore, we can simply look at all the possible lines that intersect two X'es or one X and one O, and see if any of them are dividing lines. So, for each of the O(n^2) pairs, iterate over all the n-2 other elements to see if all the X'es are on one side and all the O's on the other. Total time complexity: O(n^3).

Solution 4

Linear perceptron is guaranteed to find such separation if one exists.

See: http://en.wikipedia.org/wiki/Perceptron .

Solution 5

Computing a linear SVM then determining which side of the computed plane with optimal marginals each point lies on will tell you if the points are linearly separable.

This is overkill, but if you need a quick one off solution, there are many existing SVM libraries that will do this for you.

View more solutions

10,246

Håvard Geithus

We're hiring! LinkedIn: https://www.linkedin.com/in/havardge

Updated on June 28, 2022

Comments

Håvard Geithus almost 2 years

There are two classes, let's call them X and O. A number of elements belonging to these classes are spread out in the xy-plane. Here is an example where the two classes are not linearly separable. It is not possible to draw a straight line that perfectly divides the Xs and the Os on each side of the line.

How to determine, in general, whether the two classes are linearly separable?. I am interested in an algorithm where no assumptions are made regarding the number of elements or their distribution. An algorithm of the lowest computational complexity is of course preferred.
- tpg2114
  
  This might be better on math.stackexchange.com
Aasmund Eldhuset about 12 years

Maybe I'm missing something obvious, but can't this be solved in O(n^3) by checking every pair of X'es and, for each pair, check if the line between them is a separating line? (Edit: Need to check every pair of one X and one O too.)
Ben Jackson about 12 years

The Wikipedia entry links to references where it claims that even 2 clusters in Euclidean space is NP-hard.
Aasmund Eldhuset about 12 years

@BenJackson: K-means clustering is a different problem, in which the requirement is that each element should belong to the cluster whose mean value the element is closest to, and where there is only one type of elements. The OP's problem is to partition a collection of two types of elements into two clusters that can be separated by a line.
Ben Jackson about 12 years

@AasmundEldhuset: You are correct, I answered the question of classifying elements into two groups, while the question asks about pre-classified data. So my answer is wrong, but I'll leave it here since the discussion is informative.
Ben Jackson about 12 years

@AasmundEldhuset: However I think your answer is also wrong, since I can construct cases where checking lines between points in X finds all lines that bisect O even though a line exists which does not. I haven't constructed a case where it fails if you check both X and O though.
Ben Jackson about 12 years

Counterexample: 3 X in a small triangle, point down. 3 O widely spaced in a line below that. No line through any 2 X works. Now in that example if you instead search O it works, but I'm not sure if there's a further modification to invalidate that as well
Aasmund Eldhuset about 12 years

@BenJackson: Whoops, forgot to include the modification I made to my own comment to your post... Fixed now.
ldog about 12 years

This should do it. You can show that the optimal algorithm for taking convex hulls is O(n log n), since if it is not, then you can implement a sorting algorithm using the algorithm to compute convex hulls; and since you can not sort in faster than O(n log n), this implies you can not take convex hulls faster.
ldog about 12 years

I should note, in any discussion of SVM's the intersection of convex hulls are very pertinent and usually occupy the first few paragraphs or chapters.
rap music about 12 years

If one is implementing from scratch, it's easier to do O(n) point-in-polygon tests (O(log n)-time each) than line-line intersections: the hulls intersect if and only if they have vertices inside one another. This method is great for 2D but scales very poorly in d, the number of dimensions, because the convex hull can have size Omega(n^floor(d/2)).
rap music about 12 years

I take back the great comment. This problem can be formulated as an LP with two variables and solved in time O(n) by the algorithm of Megiddo and Dyer.
Darren Engwirda about 12 years

@rapmusic: No. It's possible to have intersecting polygons where no vertices are enclosed. For instance, it's easy to draw two intersecting triangles where none of the vertices fall inside the other triangle. In general the intersection tests are necessary.
Darren Engwirda about 12 years

@rapmusic: Could you provide a link to the LP method you've mentioned?
Alexandre C. over 8 years

This is not really overkill, since this is equivalent to the linear programming technique that some other answers suggest.