Why is a 3-way merge advantageous over a 2-way merge?

80,185

Solution 1

Say you and your friend both checked out a file, and made some changes to it. You removed a line at the beginning, and your friend added a line at the end. Then he committed his file, and you need to merge his changes into your copy.

If you were doing a two-way merge (in other words, a diff), the tool could compare the two files, and see that the first and last lines are different. But how would it know what to do with the differences? Should the merged version include the first line? Should it include the last line?

With a three-way merge, it can compare the two files, but it can also compare each of them against the original copy (before either of you changed it). So it can see that you removed the first line, and that your friend added the last line. And it can use that information to produce the merged version.

Solution 2

This slide from a perforce presentation is interesting:

slide image

The essential logic of a three-way merge tool is simple:

  • Compare base, source, and target files
  • Identify the "chunks" in the source and target files file:
    • Chunks that don't match the base
    • Chunks that do match the base
  • Then, put together a merged result consisting of:
    • The chunks that match one another in all 3 files
    • The chunks that don't match the base in either the source or in the target but not in both
    • The chunks that don't match the base but that do match each other (i.e., they've been changed the same way in both the source and the target)
    • Placeholders for the chunks that conflict, to be resolved by the user.

Note that the "chunks" in this illustration are purely symbolic. Each could represent lines in a file, or nodes in a hierarchy, or even files in a directory. It all depends on what a particular merge tool is capable of.

You may be asking what advantage a 3-way merge offers over a 2-way merge. Actually, there is no such thing as a two-way merge, only tools that diff two files and allow you to "merge" by picking chunks from one file or the other.
Only a 3-way merge gives you the ability to know whether or not a chunk is a change from the origin and whether or not changes conflict.

Solution 3

A three-way merge is where two changesets to one base file are merged as they are applied, as opposed to applying one, then merging the result with the other.

For example, having two changes where a line is added in the same place could be interpreted as two additions, not a change of one line.

For example, file a has been modified by two people, one adding moose, one adding mouse.

#File a
    dog
    cat

#diff b, a
    dog
+++ mouse
    cat

#diff c, a
    dog
+++ moose
    cat

Now, if we merge the changesets as we apply them, we will get (3-way merge)

#diff b and c, a
    dog
+++ mouse
+++ moose
    cat

But if we apply b, then look at the change from b to c it will look like we are just changing a 'u' to an 'o' (2-way merge)

    #diff b, c
    dog
--- mouse
+++ moose
    cat

Solution 4

enter image description here

Borrowed from AWS CodeCommit: Developer Tools > CodeCommit > Repositories > RepositoryName > Pull requests > Pull request name > Merge

Share:
80,185

Related videos on Youtube

Johannes Bittner
Author by

Johannes Bittner

Updated on December 27, 2021

Comments

  • Johannes Bittner
    Johannes Bittner over 2 years

    Wikipedia says a 3-way merge is less error-prone than a 2-way merge, and often times doesn't need user intervention. Why is this the case?

    An example where a 3-way merge succeeds and a 2-way merge fails would be helpful.

  • Vlad
    Vlad about 10 years
    "whether or not changes conflict." - doesn't 2-way merge (diff) shows a conflict as well (though the info is lost as of the source of the conflict)/
  • Wernight
    Wernight over 9 years
    It's however common in Git to have a 4-way merge where the base is actually not the same. Still a 3-way merge is better and 2-way.
  • Pacerier
    Pacerier about 9 years
    @Wernight, Is there a 5-way merge?
  • Wernight
    Wernight about 9 years
    @Pacerier Not that I know of, but that's what is actually happening during a git cherry-pick, or rebase.
  • Kaneg
    Kaneg over 7 years
    Very detailed and useful explanation
  • VonC
    VonC about 4 years
    @Qwerty This is more about configuring the merge conflict visualization. The merge itself is three-way already with Git: stackoverflow.com/a/47115936/6309
  • Qwerty
    Qwerty about 4 years
    My git was doing two way merges without having it set explicitly as in the linked answer. An I thought that when someone reads this, they might be interested in knowing how to enable it right away. EDIT: Okay I understand, of course the merge was always three way. I will delete and repost the comment, since I can't edit it now, so can you.
  • Qwerty
    Qwerty about 4 years
    How to configure git to see 3-way merge diffs in diff tool stackoverflow.com/a/18131595/985454
  • Harry
    Harry about 4 years
    "But how would it know what to do with the differences?" Didn't get it. If it can already see the differences between the two files (without reference to the original), why can't it apply both changes serially in increasing order of files' timestamps? That is: It starts off with my friend's committed copy taking it to be the (new) original (with the line addition at the top) and then, on top of it, applies my local changes (line deletion at the botton).
  • JW.
    JW. about 4 years
    @Harry Say the original had three lines (ABC). It starts with my friend's copy (ABCD), and compares it to mine (BC). Without seeing the original, it might think that I removed both A and D, and that the final result should be BC.
  • Jake Stevens-Haas
    Jake Stevens-Haas about 3 years
    @Harry if each file had a list of timestamped changes since the common ancestor, you would have a 3-way merge. The method you described would require rewinding the file back to the common ancestor in order to apply the diffs chronologically. Phrased differently, I'm not sure there's an unambiguous meaning to "a timestamped diff between two files without reference to a common ancestor."
  • JP Zhang
    JP Zhang over 2 years
    In the picture, which line refers to the source branch, and which one refers to the designation branch?