开发者

Java file hash for identifying identical files

开发者 https://www.devze.com 2023-03-02 04:13 出处:网络
I would like to get the hash of files (mostly video files) independent of external properties such as开发者_开发知识库 path and file name. I\'ll be needing to store hash in a database and compare file

I would like to get the hash of files (mostly video files) independent of external properties such as开发者_开发知识库 path and file name. I'll be needing to store hash in a database and compare file hash to find identical files.


Have a look at the DigestInputStream class: http://docs.oracle.com/javase/7/docs/api/java/security/DigestInputStream.html


public byte[] digestFile( File f ){
  try {
    MessageDigest messageDigest = MessageDigest.getInstance("SHA-1");
    FileInputStream fis = new FileInputStream( f );
    byte[] buffer = new byte[1024];
    int read = -1;
    while ((read = fis.read(buffer)) != -1) {
      messageDigest.digest(buffer, 0, read);
    }
    return messageDigest.digest();
  } catch (VariousExceptions e) {
    //handle
  }
}


Depending on what you need, you can do this pretty easily using Guava's Files and ByteStreams classes:

byte[] digest = Files.getDigest(file, MessageDigest.getInstance("SHA"));
0

精彩评论

暂无评论...
验证码 换一张
取 消