How to split strings into characters in Scala

53,784

Solution 1

You can use toList as follows:

scala> s.toList         
res1: List[Char] = List(T, e, s, t)

If you want an array, you can use toArray

scala> s.toArray
res2: Array[Char] = Array(T, e, s, t)

Solution 2

Do you need characters?

"Test".toList    // Makes a list of characters
"Test".toArray   // Makes an array of characters

Do you need bytes?

"Test".getBytes  // Java provides this

Do you need strings?

"Test".map(_.toString)    // Vector of strings
"Test".sliding(1).toList  // List of strings
"Test".sliding(1).toArray // Array of strings

Do you need UTF-32 code points? Okay, that's a tougher one.

def UTF32point(s: String, idx: Int = 0, found: List[Int] = Nil): List[Int] = {
  if (idx >= s.length) found.reverse
  else {
    val point = s.codePointAt(idx)
    UTF32point(s, idx + java.lang.Character.charCount(point), point :: found)
  }
}
UTF32point("Test")

Solution 3

Actually you don't need to do anything special. There is already implicit conversion in Predef to WrappedString and WrappedString extends IndexedSeq[Char] so you have all goodies that available in it, like:

"Test" foreach println
"Test" map (_ + "!") 

Edit

Predef has augmentString conversion that has higher priority than wrapString in LowPriorityImplicits. So String end up being StringLike[String], that is also Seq of chars.

Solution 4

Additionally, it should be noted that if what you actually want isn't an actual list object, but simply to do something which each character, then Strings can be used as iterable collections of characters in Scala

for(ch<-"Test") println("_" + ch + "_") //prints each letter on a different line, surrounded by underscores
Share:
53,784
sam
Author by

sam

Updated on July 08, 2022

Comments

  • sam
    sam almost 2 years

    For example, there is a string val s = "Test". How do you separate it into t, e, s, t?

  • Kevin Wright
    Kevin Wright about 13 years
    I think it goes via StringOps actually.
  • tenshi
    tenshi about 13 years
    @Kevin Wright: You are right. Predef has augmentString conversion that has higher priority than wrapString in LowPriorityImplicits. Sorry for this, I don't noticed it. Thanks!
  • Kevin Wright
    Kevin Wright about 13 years
    For even being aware of code points, you deserve to have the accepted answer here. Though I doubt the OP will appreciate the subtlety.
  • ldmtwo
    ldmtwo almost 10 years
    For those seeing this, toCharArray should always be preferred when possible. toList is painfully slow. Each character primitive becomes a Character object. Each link in the list is an object as well. 2 bytes becomes 12+2+12+2=28 bytes. We can no longer have quick and random access. However, if you are just playing around or writing Hello World, then have at it, but don't expect it to scale well.
  • zyxue
    zyxue over 7 years
    I am new to Scala. Why does "abc".toList work but "abc".toList() doesn't work. Where can I find documentation for scala String.
  • zyxue
    zyxue over 7 years
    Found the answer here, stackoverflow.com/questions/6643030/…, never mind.