22.1 The Interface java.io.DataInput

The DataInput interface provides for reading bytes from a binary stream and reconstructing from them data in any of the Java primitive types. There is also a facility for reconstructing a String from data in Java modified UTF-8 format.

The DataOutput interface (§22.2) supports the creation of binary output data suitable for reading back in through the DataInput interface.

The DataInput interface is implemented by classes DataInputStream (§22.11) and RandomAccessFile (§22.23).

public interface DataInput {
	public void readFully(byte[] b)
		throws IOException, NullPointerException;
	public void readFully(byte[] b, int off, int len)
		throws IOException, NullPointerException,
			IndexOutOfBoundsException;
	public int skipBytes(int n) throws IOException;
	public boolean readBoolean() throws IOException;
	public byte readByte() throws IOException;
	public int readUnsignedByte() throws IOException;
	public short readShort() throws IOException;
	public int readUnsignedShort() throws IOException;
	public char readChar() throws IOException;
	public int readInt() throws IOException;
	public long readLong() throws IOException;
	public float readFloat() throws IOException;
	public double readDouble() throws IOException;
	public String readLine() throws IOException;
	public String readUTF() throws IOException;
	public final static String readUTF(DataInput in)
		throws IOException;
}

It is generally true of all the reading routines in this interface that if end of file is reached before the desired number of bytes has been read, an EOFException (which is a kind of IOException) is thrown. If any byte cannot be read for any reason other than end of file, an IOException other than EOFException is thrown. In particular, an IOException may be thrown if the input stream has been closed (§22.3.6).

22.1.1 public void readFully(byte[] b)
throws IOException, NullPointerException;

The general contract of readFully(b) is that it reads some bytes from an input stream and stores them into the buffer array b. The number of bytes read is equal to the length of b.

This method blocks until one of the following conditions occurs:

If b is null, a NullPointerException is thrown.

If b.length is zero, then no bytes are read. Otherwise, the first byte read is stored into element b[0], the next one into b[1], and so on.

If an exception is thrown from this method, then it may be that some but not all bytes of b have been updated with data from the input stream.

22.1.2 public void readFully(byte[] b, int off, int len)
throws IOException, NullPointerException,
IndexOutOfBoundsException

The general contract of readFully(b, off, len) is that it reads len bytes from an input stream.

This method blocks until one of the following conditions occurs:

If b is null, a NullPointerException is thrown.

If off is negative, or len is negative, or off+len is greater than the length of the array b, then an IndexOutOfBoundsException is thrown.

If len is zero, then no bytes are read. Otherwise, the first byte read is stored into element b[off], the next one into b[off+1], and so on. The number of bytes read is, at most, equal to len.

If an exception is thrown from this method, then it may be that some but not all bytes of b in positions off through off+len-1 have been updated with data from the input stream.

22.1.3 public int skipBytes(int n) throws IOException

The general contract of skipBytes is that it makes an attempt to skip over n bytes of data from the input stream, discarding the skipped bytes. However, it may skip over some smaller number of bytes, possibly zero. This may result from any of a number of conditions; reaching end of file before n bytes have been skipped is only one possibility. This method never throws an EOFException. The actual number of bytes skipped is returned.

22.1.4 public boolean readBoolean() throws IOException;

The general contract of readBoolean is that it reads one input byte and returns true if that byte is nonzero, false if that byte is zero.

This method is suitable for reading the byte written by the writeBoolean method of interface DataOutput (§22.2.4).

22.1.5 public byte readByte() throws IOException

The general contract of readByte is that it reads and returns one input byte. The byte is treated as a signed value in the range -128 through 127, inclusive.

This method is suitable for reading the byte written by the writeByte method of interface DataOutput (§22.2.5).

22.1.6 public int readUnsignedByte() throws IOException

The general contract of readUnsignedByte is that it reads one input byte, zero- extends it to type int, and returns the result, which is therefore in the range 0 through 255.

This method is suitable for reading the byte written by the writeByte method of interface DataOutput (§22.2.5) if the argument to writeByte was intended to be a value in the range 0 through 255.

22.1.7 public short readShort() throws IOException

The general contract of readShort is that it reads two input bytes and returns a short value. Let a be the first byte read and b be the second byte. The value returned is:

(short)((a << 8) | (b & 0xff))

This method is suitable for reading the bytes written by the writeShort method of interface DataOutput (§22.2.6).

22.1.8 public int readUnsignedShort() throws IOException

The general contract of readUnsignedShort is that it reads two input bytes and returns an int value in the range 0 through 65535. Let a be the first byte read and b be the second byte. The value returned is:

(((a & 0xff) << 8) | (b & 0xff))

This method is suitable for reading the bytes written by the writeShort method of interface DataOutput (§22.2.6) if the argument to writeShort was intended to be a value in the range 0 through 65535.

22.1.9 public char readChar() throws IOException

The general contract of readChar is that it reads two input bytes and returns a char value. Let a be the first byte read and b be the second byte. The value returned is:

(char)((a << 8) | (b & 0xff))

This method is suitable for reading bytes written by the writeChar method of interface DataOutput (§22.2.7).

22.1.10 public int readInt() throws IOException

The general contract of readInt is that it reads four input bytes and returns an int value. Let a be the first byte read, b be the second byte, c be the third byte, and d be the fourth byte. The value returned is:


(((a & 0xff) << 24) | ((b & 0xff) << 16) |
  ((c & 0xff) <<    8) | (d & 0xff))

This method is suitable for reading bytes written by the writeInt method of interface DataOutput (§22.2.8).

22.1.11 public long readLong() throws IOException

The general contract of readLong is that it reads eight input bytes and returns a long value. Let a be the first byte read, b be the second byte, c be the third byte, d be the fourth byte, e be the fifth byte, f be the sixth byte, g be the seventh byte, and h be the eighth byte. The value returned is:


(((long)(a & 0xff) << 56) |
  ((long)(b & 0xff) << 48) |
  ((long)(c & 0xff) <<  40) |
  ((long)(d & 0xff) << 32) |
  ((long)(e & 0xff) <<  24) |
  ((long)(f & 0xff) << 16) |
  ((long)(g & 0xff) <<    8) |
  ((long)(h & 0xff)))

This method is suitable for reading bytes written by the writeLong method of interface DataOutput (§22.2.9).

22.1.12 public float readFloat() throws IOException

The general contract of readFloat is that it reads four input bytes and returns a float value. It does this by first constructing an int value in exactly the manner of the readInt method (§22.1.10), then converting this int value to a float in exactly the manner of the method Float.intBitsToFloat (§20.9.23).

This method is suitable for reading bytes written by the writeFloat method of interface DataOutput (§22.2.10).

22.1.13 public double readDouble() throws IOException

The general contract of readDouble is that it reads eight input bytes and returns a double value. It does this by first constructing a long value in exactly the manner of the readlong method (§22.1.11), then converting this long value to a double in exactly the manner of the method Double.longBitsToDouble (§20.10.22).

This method is suitable for reading bytes written by the writeDouble method of interface DataOutput (§22.2.11).

22.1.14 public String readLine() throws IOException

The general contract of readLine is that it reads successive bytes, converting each byte separately into a character, until it encounters a line terminator or end of file; the characters read are then returned as a String. Note that because this method processes bytes, it does not support input of the full Unicode character set.

If end of file is encountered before even one byte can be read, then null is returned. Otherwise, each byte that is read is converted to type char by zero-extension. If the character '\n' is encountered, it is discarded and reading ceases. If the character '\r' is encountered, it is discarded and, if the following byte converts to the character '\n', then that is discarded also; reading then ceases. If end of file is encountered before either of the characters '\n' and '\r' is encountered, reading ceases. Once reading has ceased, a String is returned that contains all the characters read and not discarded, taken in order. Note that every character in this string will have a value less than \u0100, that is, (char)256.

22.1.15 public String readUTF() throws IOException

The general contract of readUTF is that it reads a representation of a Unicode character string encoded in Java modified UTF-8 format; this string of characters is then returned as a String.

First, two bytes are read and used to construct an unsigned 16-bit integer in exactly the manner of the readUnsignedShort method (§22.1.8). This integer value is called the UTF length and specifies the number of additional bytes to be read. These bytes are then converted to characters by considering them in groups. The length of each group is computed from the value of the first byte of the group. The byte following a group, if any, is the first byte of the next group.

If the first byte of a group matches the bit pattern 0xxxxxxx (where x means "may be 0 or 1"), then the group consists of just that byte. The byte is zero-extended to form a character.

If the first byte of a group matches the bit pattern 110xxxxx, then the group consists of that byte a and a second byte b. If there is no byte b (because byte a was the last of the bytes to be read), or if byte b does not match the bit pattern 10xxxxxx, then a UTFDataFormatException is thrown. Otherwise, the group is converted to the character:

(char)(((a & 0x1F) << 6) | (b & 0x3F))

If the first byte of a group matches the bit pattern 1110xxxx, then the group consists of that byte a and two more bytes b and c. If there is no byte c (because byte a was one of the last two of the bytes to be read), or either byte b or byte c does not match the bit pattern 10xxxxxx, then a UTFDataFormatException is thrown. Otherwise, the group is converted to the character:

(char)(((a & 0x0F) << 12) | ((b & 0x3F) << 6) | (c & 0x3F))

If the first byte of a group matches the pattern 1111xxxx or the pattern 10xxxxxx, then a UTFDataFormatException is thrown.

If end of file is encountered at any time during this entire process, then an EOFException is thrown.

After every group has been converted to a character by this process, the characters are gathered, in the same order in which their corresponding groups were read from the input stream, to form a String, which is returned.

The writeUTF method of interface DataOutput (§22.2.14) may be used to write data that is suitable for reading by this method.