Golang: Issues replacing newlines in a string from a text file

32,128

Solution 1

I guess that you are running the code using Windows. Observe that if you print out the length of the resulting string, it will show something over 100 characters. The reason is that Windows uses not only newlines (\n) but also carriage returns (\r) - so a newline in Windows is actually \r\n, not \n. To properly filter them out of your string, use:

re := regexp.MustCompile(`\r?\n`)
input = re.ReplaceAllString(input, " ")

The backticks will make sure that you don't need to quote the backslashes in the regular expression. I used the question mark for the carriage return to make sure that your code works on other platforms as well.

Solution 2

I do not think that you need to use regex for such an easy task. This can be achieved with just

absPath, _ := filepath.Abs("../Go/input.txt")
data, _ := ioutil.ReadFile(absPath)
input := string(data)
strings.Replace(input, "\n","",-1)

example of removing \n

Share:
32,128
Orange Receptacle
Author by

Orange Receptacle

Updated on July 23, 2022

Comments

  • Orange Receptacle
    Orange Receptacle almost 2 years

    I've been trying to have a File be read, which will then put the read material into a string. Then the string will get split by line into multiple strings:

    absPath, _ := filepath.Abs("../Go/input.txt")
    data, err := ioutil.ReadFile(absPath)
    if err != nil {
        panic(err)
    }
    input := string(data)
    

    The input.txt is read as:

    a

    strong little bird

    with a very

    big heart

    went

    to school one day and

    forgot his food at

    home

    However,

    re = regexp.MustCompile("\\n")
    input = re.ReplaceAllString(input, " ")
    

    turns the text into a mangled mess of:

    homeot his food atand

    I'm not sure how replacing newlines can mess up so badly to the point where the text inverts itself

  • madsen
    madsen over 8 years
    That wouldn't solve the issue though, since — as Jens mentioned — Windows newlines (\r\n) are causing the described effect when only the \n is replaced while leaving the \r in the string.
  • madsen
    madsen over 8 years
    No reason to get aggressive. Your point about not necessarily needing regex is fine, but your answer doesn't help OP solve his actual problem.
  • madsen
    madsen over 8 years
    If it wasn't for Jens' answer, OP wouldn't know that the \r was the problem (and OP's output is more than enough to know that that is in fact the case). Your answer just provides OP with a different way of getting where he already is. It's not about inability to modify a working solution, it's about the fact that your solution is ... well ... not a solution to OP's problem.
  • Salvador Dali
    Salvador Dali over 8 years
    @madsen, you are basing your justification on the fact, that Jen's answer solved OP's problem. Up till now I see that 2 hours passed and OP has not told anything. Which might mean that Jen's answer has not fixed anything.
  • madsen
    madsen over 8 years
    You got to be kidding. If you know what \r does, then you know that OP's output is caused by exactly that — not by whether or not he's using regexes. Your answer only works if you apply Jens' fix too, otherwise it won't make any difference at all.