开发者

How to check if 2 large text files content are 100% same?

开发者 https://www.devze.com 2023-03-15 23:19 出处:网络
I have two large text files (about 1.8 GB each encoded in UTF-8). And I want to check if the content of both files are 100% same. If there is single character difference then it is considered not same

I have two large text files (about 1.8 GB each encoded in UTF-8). And I want to check if the content of both files are 100% same. If there is single character difference then it is considered not same, i.e. both files are different. How do I do that?

So my question is: How to check if 2 large text files content are 100% same?

Dim开发者_Python百科 TextFile1 as String = "C:\TextFile1.log"
Dim TextFile2 as String = "C:\TextFile2.log"

UPDATE:

I forgot to add, the solution should be using minimal memory because the specific PC only has 256 mb memory


Well, for starters you can compare their size (if they're not the same size they can't have the same content). Look at the FileInfo.Length property for this.

If they do have the same size, you can compare them byte by byte


I have run into a similar problem where I needed to identify duplicate pdf files that had the names changed. The best way to do this was by using md5 hash. I did a quick google search and it looks vb.net does have support built in funcationallity to do this. Check out http://www.vbknowledgebase.com/?Id=88&Desc=Find-Duplicate-files-using-Vb.Net-using-MD5-Hash

0

精彩评论

暂无评论...
验证码 换一张
取 消