Reading binary istream byte by byte

45,700

Solution 1

The >> extractors are for formatted input; they skip white space (by default). For single character unformatted input, you can use istream::get() (returns an int, either EOF if the read fails, or a value in the range [0,UCHAR_MAX]) or istream::get(char&) (puts the character read in the argument, returns something which converts to bool, true if the read succeeds, and false if it fails.

Solution 2

there is a read() member function in which you can specify the number of bytes.

Solution 3

Why are you using formatted extraction, rather than .read()?

Solution 4

source.get()

will give you a single byte. It is unformatted input function. operator>> is formatted input function that may imply skipping whitespace characters.

Solution 5

As others mentioned, you should use istream::read(). But, if you must use formatted extraction, consider std::noskipws.

Share:
45,700
Adrian McCarthy
Author by

Adrian McCarthy

I'm a life-long programmer with a variety of interests, including ray tracing, text processing. I've written a murder mystery about a software engineer who discovers that solving a crime is a lot like debugging a program. Check out Blue Screen of Death at your favorite ebook retailer. A second novel, Access Violation, is in the works. SOreadytohelp

Updated on August 29, 2020

Comments

  • Adrian McCarthy
    Adrian McCarthy over 3 years

    I was attempting to read a binary file byte by byte using an ifstream. I've used istream methods like get() before to read entire chunks of a binary file at once without a problem. But my current task lends itself to going byte by byte and relying on the buffering in the io-system to make it efficient. The problem is that I seemed to reach the end of the file several bytes sooner than I should. So I wrote the following test program:

    #include <iostream>
    #include <fstream>
    
    int main() {
        typedef unsigned char uint8;
        std::ifstream source("test.dat", std::ios_base::binary);
        while (source) {
            std::ios::pos_type before = source.tellg();
            uint8 x;
            source >> x;
            std::ios::pos_type after = source.tellg();
            std::cout << before << ' ' << static_cast<int>(x) << ' '
                      << after << std::endl;
        }
        return 0;
    }
    

    This dumps the contents of test.dat, one byte per line, showing the file position before and after.

    Sure enough, if my file happens to have the two-byte sequence 0x0D-0x0A (which corresponds to carriage return and line feed), those bytes are skipped.

    • I've opened the stream in binary mode. Shouldn't that prevent it from interpreting line separators?
    • Do extraction operators always use text mode?
    • What's the right way to read byte by byte from a binary istream?

    MSVC++ 2008 on Windows.