Java string split with "." (dot)

550,351

Solution 1

You need to escape the dot if you want to split on a literal dot:

String extensionRemoved = filename.split("\\.")[0];

Otherwise you are splitting on the regex ., which means "any character".
Note the double backslash needed to create a single backslash in the regex.


You're getting an ArrayIndexOutOfBoundsException because your input string is just a dot, ie ".", which is an edge case that produces an empty array when split on dot; split(regex) removes all trailing blanks from the result, but since splitting a dot on a dot leaves only two blanks, after trailing blanks are removed you're left with an empty array.

To avoid getting an ArrayIndexOutOfBoundsException for this edge case, use the overloaded version of split(regex, limit), which has a second parameter that is the size limit for the resulting array. When limit is negative, the behaviour of removing trailing blanks from the resulting array is disabled:

".".split("\\.", -1) // returns an array of two blanks, ie ["", ""]

ie, when filename is just a dot ".", calling filename.split("\\.", -1)[0] will return a blank, but calling filename.split("\\.")[0] will throw an ArrayIndexOutOfBoundsException.

Solution 2

The dot "." is a special character in java regex engine, so you have to use "\\." to escape this character:

final String extensionRemoved = filename.split("\\.")[0];

Solution 3

This is because . is a reserved character in regular expression, representing any character. Instead, we should use the following statement:

String extensionRemoved = filename.split("\\.")[0];

Solution 4

I believe you should escape the dot. Try:

String filename = "D:/some folder/001.docx";
String extensionRemoved = filename.split("\\.")[0];

Otherwise dot is interpreted as any character in regular expressions.

Share:
550,351
Ali Ismayilov
Author by

Ali Ismayilov

Updated on December 03, 2021

Comments

  • Ali Ismayilov
    Ali Ismayilov over 2 years

    Why does the second line of this code throw ArrayIndexOutOfBoundsException?

    String filename = "D:/some folder/001.docx";
    String extensionRemoved = filename.split(".")[0];
    

    While this works:

    String driveLetter = filename.split("/")[0];
    

    I use Java 7.

    • Felix Glas
      Felix Glas over 11 years
      Doesn't split use a regex string? In that case "." means any character.
    • Ben
      Ben over 10 years
      ...and it's a DOUBLE backslash to delimit.
  • Nic
    Nic about 8 years
    It is not a special character in Java. It's a special character in Java's regex engine.
  • aimhaj
    aimhaj about 8 years
    I just wasn't very accurate in my response but I agree with you. thanks for the precision ;)
  • Nic
    Nic about 8 years
    It's a distinction worth making. Also, I just realized that I messed up a bit myself; it is a special char in Java, but that's not why it's causing a problem here. Anyway.
  • saurabheights
    saurabheights almost 7 years
    Note that filename can contain multiple dots. One must use the last index of "." and use that to find the substring of the filename.
  • Bohemian
    Bohemian almost 7 years
    @saurabheights The question was not about a correct regex, but rather why there was a an ArrayIndexOutOfBoundsException. That said, you are incorrect: You don't need to know where the last dot is; you just need the right regex: filename.split("\\.(?=[^.]*$)"). This uses a look ahead to assert there are no dots anywhere in the input that follows the matching dot.
  • Bohemian
    Bohemian over 4 years
    @emma you can delete them yourself via the “delete” link just beneath the question