Discovering Data
  • Home
  • Blog

Compare document similarities

6/16/2017

0 Comments

 
The following script will take a document and compare it to a set of documents to find the document similarities.

comparison document
  • The default installation directory for Icecream Ebook Reader is Icecream Ebook Reader with weak folder permissions that grants EVERYONE modify privileges to the contents of the 

the documents
  • directory and it's subfolders. This allows an attacker opportunity for their own code execution under any other user running the application."
  • "An insecure file permissions vulnerability has been discovered in the official Icecream Ebook Reader v4.53 software. The vulnerability allows local attackers with system user accounts to 
  • elevate the access to higher system privileges."
  • "A persistent cross site scripting web vulnerability has been discovered in the official Zenario v7.6 content management system."
  • "While performing network level testing of various Google applications, we discovered that the content for the application did not use SSL."
The Script

    
output of the script:
[ 0.48266575  0.          0.01086096  0.13409612  0.17690402]
So the comparison document most closely matches the first  document. The least similar is the second document with a score of two.

0 Comments



Leave a Reply.

    This blog includes:

    Scripts mainly in Python with a few in R covering NLP, Pandas, Matplotlib and others. See the home page for links to some of the scripts.  Also includes some explanations of basic data science terminology.

    Archives

    October 2018
    June 2018
    April 2018
    June 2017
    April 2017
    March 2017
    February 2017
    January 2017
    November 2016
    September 2016
    July 2016
    June 2016
    May 2016
    November 2015
    November 2014

    RSS Feed

Proudly powered by Weebly
  • Home
  • Blog