Overcomplicated oracle jdbc BLOB handling

44,393

Solution 1

The update approach you mention in the first case can be rewritten using pure JDBC code and thus reduce your dependency on Oracle-specific classes. This could be helpful if your app needs to be database agnostic.

public static void updateBlobColumn(Connection con, String table, String blobColumn, byte[] inputBytes, String idColumn, Long id) throws SQLException {
  PreparedStatement pStmt = null;
  ResultSet rs = null;
  try {
    String sql = 
      " SELECT " + blobColumn + 
      " FROM " + table + 
      " WHERE " + idColumn + " = ? " +
      " FOR UPDATE";
    pStmt = con.prepareStatement(sql, 
      ResultSet.TYPE_FORWARD_ONLY, 
      ResultSet.CONCUR_UPDATABLE);
    pStmt.setLong(1, id);
    rs = pStmt.executeQuery();
    if (rs.next()) {
      Blob blob = rs.getBlob(blobColumn);
      blob.truncate(0);
      blob.setBytes(1, inputBytes);
      rs.updateBlob(blobColumn, blob);
      rs.updateRow();
    }
  }
  finally {
    if(rs != null) rs.close();
    if(pStmt != null) pStmt.close();
  }
}

For MSSQL I understand that the locking syntax is different:

String sql = 
  " SELECT " + blobColumn + 
  " FROM " + table + " WITH (rowlock, updlock) " + 
  " WHERE " + idColumn + " = ? "

Solution 2

Another point of view from Oracle DBA. Sun guys did very poor job when they designed JDBC standards(1.0, 2.0, 3.0, 4.0). BLOB stands for large object and therefore it can be very large. It is something that can not be stored in JVM heap. Oracle thinks of BLOBs as something like file handles(it fact they are call then "lob locators"). LOBS can not be created via constructor and are not Java objects. Also LOB locators(oracle.sql.BLOB) can not be created via constructor - they MUST be created in the DB side. In Oracle there are two ways how to create a LOB.

  1. DBMS_LOB.CREATETEMPORATY - the returned locator in this case points into temporary tablespace. All the writes/reads against this locator will be sent via network onto DB server. Nothing is stored in JVM heap.

  2. Call to EMPTY_BLOB function. INSERT INTO T1(NAME, FILE) VALUES("a.avi", EMPTY_BLOB()) RETURNING FILE INTO ?; In this case returned lob locator points into data tablespace. All the writes/reads against this locator will be sent via network onto DB server. All the writes are "guarded" by writes into redo-logs. Nothing is stored in JVM heap. The returning clause was not supported by JDBC standards (1.0, 2.0), therefore you can find many examples on the internet where people recommend approach of two steps: "INSERT...; SELECT ... FOR UPDATE;"

Oracle lobs must be associated with some database connection, they can not be used when DB connection is lost/closed/(or "commited"). They can not be passed from one connection to another.

You second example can work, but will require excessive copying if data from temporary tablespace into data tablespace.

Solution 3

The Oracle server's LOB handling is pretty poor and can suffer from serious performance problems (e.g. massive overuse of the redo log), so the first solution may be a way to address those.

I would suggest trying both approaches. if you have a competent DBA, they may be able to advise which approach has the lowest impact on the server.

Solution 4

One interesting thing with JDBC is you can upgrade rather aggressively to the latest drivers and work with JDBC 4.0 features. The oracle JDBC drivers will work with older database versions, so you can use an 11g branded JDBC driver against a 10g database. The Oracle database 11g JDBC comes in two flavors: ojdbc5.jar for Java 5 (i.e., JDK 1.5) and ojdbc6.jar for Java 6 (i.e., JDK 1.6). The ojdbc6.jar supports the new JDBC 4.0 specification.

With the newer drivers/jdbc 4.0 you can create Blobs and Clobs off the connection object:

Blob aBlob = con.createBlob();
int numWritten = aBlob.setBytes(1, val);

Solution 5

This statement :

blob.setBytes(1, inputBytes);

is giving issues when I use oracle thin client ojdbc14.jar, "Unsupported Features"

So, I had to work around by :

rset.next();
Blob bobj = rset.getBlob(1);
BLOB object = (BLOB) bobj;
int chunkSize = object.getChunkSize();
byte[] binaryBuffer = new byte[chunkSize];
int position = 1;
int bytesRead = 0;
int bytesWritten = 0, totbytesRead = 0, totbytesWritten = 0;
InputStream is = fileItem.getInputStream();
while ((bytesRead = is.read(binaryBuffer)) != -1) {
bytesWritten = object.putBytes(position, binaryBuffer, bytesRead);
position += bytesRead;
totbytesRead += bytesRead;
totbytesWritten += bytesWritten;
is.close();
Share:
44,393
Raymond Lukanta
Author by

Raymond Lukanta

Updated on August 20, 2020

Comments

  • Raymond Lukanta
    Raymond Lukanta over 3 years

    When I search the web for inserting BLOBs into Oracle database with jdbc thin driver, most of the webpages suggest a 3-step approach:

    1. insert empty_blob() value.
    2. select the row with for update.
    3. insert the real value.

    This works fine for me, here is an example:

    Connection oracleConnection = ...
    
    byte[] testArray = ...
    
    PreparedStatement ps = oracleConnection.prepareStatement(
        "insert into test (id, blobfield) values(?, empty_blob())");
    ps.setInt(1, 100);
    ps.executeUpdate();
    ps.close();
    ps = oracleConnection.prepareStatement(
        "select blobfield from test where id = ? for update");
    ps.setInt(1, 100);
    OracleResultSet rs = (OracleResultSet) ps.executeQuery();
    if (rs.next()) {
        BLOB blob = (BLOB) rs.getBLOB(1);
        OutputStream outputStream = blob.setBinaryStream(0L);
        InputStream inputStream = new ByteArrayInputStream(testArray);
        byte[] buffer = new byte[blob.getBufferSize()];
        int byteread = 0;
        while ((byteread = inputStream.read(buffer)) != -1) {
            outputStream.write(buffer, 0, byteread);
        }
        outputStream.close();
        inputStream.close();
    }
    

    There are some webpages where the authors suggest using a simpler 1-step solution. Previous example with this solution:

    Connection oracleConnection = ...
    
    byte[] testArray = ...
    
    PreparedStatement ps = oracleConnection.prepareStatement(
        "insert into test(id, blobfield) values(?, ?)");
    BLOB blob = BLOB.createTemporary(oracleConnection, false, BLOB.DURATION_SESSION);
    OutputStream outputStream = blob.setBinaryStream(0L);
    InputStream inputStream = new ByteArrayInputStream(testArray);
    byte[] buffer = new byte[blob.getBufferSize()];
    int byteread = 0;
    while ((byteread = inputStream.read(buffer)) != -1) {
        outputStream.write(buffer, 0, byteread);
    }
    outputStream.close();
    inputStream.close();
    
    ps.setInt(1, 100);
    ps.setBlob(2, blob);
    ps.executeUpdate();
    ps.close();
    

    The second code is much more easier, so my question is: What is the point of first (popular) solution? Is there (was there) some kind of constraint for the second solution (Oracle server version number, jdbc driver version, size of the blob,...)? Is the first solution better (speed, memory consumption,...)? Any reasons for not using the simpler second approach?

    The exact same question applies for CLOB fields.