Regex to match four repeated letters in a string using a Java pattern

32,696

Solution 1

Not knowing about the finite repetition syntax, your own problem solving skill should lead you to this:

([a-z])\1\1\1

Obviously it's not pretty, but:

  • It works
  • It exercises your own problem solving skill
  • It may lead you to deeper understanding of concepts
    • In this case, knowing the desugared form of the finite repetition syntax

I have a concern:

  • "ffffffff".matches("([a-z])\\1{3,}") = true
  • "fffffasdf".matches("([a-z])\\1{3,}") = false
  • "asdffffffasdf".matches("([a-z])\\1{3,}") = false

What can I do for the bottom two?

The problem is that in Java, matches need to match the whole string; it is as if the pattern is surrounded by ^ and $.

Unfortunately there is no String.containsPattern(String regex), but you can always use this trick of surrounding the pattern with .*:

"asdfffffffffasf".matches(".*([a-z])\\1{3,}.*") // true!
//                         ^^              ^^

Solution 2

You want to match a single character and then that character repeated three more times:

([a-z])\1{3}

Note: In Java you need to escape the backslashes inside your regular expressions.


Update: The reason why it isn't doing what you want is because you are using the method matches which requires that the string exactly matches the regular expression, not just that it contains the regular expression. To check for containment you should instead use the Matcher class. Here is some example code:

import java.util.regex.Pattern;
import java.util.regex.Matcher;

class Program
{
    public static void main(String[] args)
    {
        Pattern pattern = Pattern.compile("([a-z])\\1{3}");
        Matcher matcher = pattern.matcher("asdffffffasdf");
        System.out.println(matcher.find());
    }
}

Result:

true

Solution 3

You can put {n} after something to match it n times, so:

([a-z])\1{3}

Solution 4

General regex pattern for predefinite repetition is {4}.

Thus here ([a-z])\1{3} should match your 4 chars.

Share:
32,696
Anonymous user
Author by

Anonymous user

Updated on July 09, 2022

Comments

  • Anonymous user
    Anonymous user almost 2 years

    I want to match something like aaaa, aaaad, adjjjjk. Something like ([a-z])\1+ was used to match the repeated characters, but I am not able to figure this out for four letters.

  • seinecle
    seinecle about 11 years
    If you are trying to match a string where the string you look for is just before an end of line character, the pattern above won't match. (I learnt that the hard way!). You could do: '"asdfffffffffasf".matches(".*([a-z])\\1{3,}(?s).*")' where '(?s)' triggers the matching for end of line chars.
  • Brownsoo Han
    Brownsoo Han about 7 years
    it is best to find character repeating.
  • Parth Bhoiwala
    Parth Bhoiwala about 6 years
    I have been looking for this since last 2 hours. Thank you very much