开发者

MySQL query to check for certain phrases (duplicate article, plagiarism)

开发者 https://www.devze.com 2023-01-19 18:40 出处:网络
Is there a way to check for multiple phrases in mysql? I need to check if an article have a duplicate version stored in mysql.

Is there a way to check for multiple phrases in mysql?

I need to check if an article have a duplicate version stored in mysql.

This is the algorithm I first create an array of sentences that needed to be check (removing all non-alpha-numeric characters) Then Build the query (how?) After I get the result I compare if 50% of the sentences are duplicate than I consider the article to be duplicate. Articles in the table are stored with removed non-alpha-numeric characters. Example:

iamdevelopingatoolt开发者_C百科ocheckduplicatearticlesstoredinmysqldatabasehoweveriveencountered 

Any suggestions?


Yes, look into "Programming Collective Intelligence" to learn about these algorithms. They have to do with grouping.

0

精彩评论

暂无评论...
验证码 换一张
取 消