Skip to content

Vrinda2403/TextSure

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TextSure

A Document Similarity & Text Comparison Tool

TextSure is a web-based document similarity checker that compares two documents and determines how similar they are using Natural Language Processing (NLP) techniques. It supports sentence-level similarity detection and provides a clean, user-friendly interface.

The project is built using Flask (Python) for the backend and HTML/CSS/JavaScript for the frontend, and is deployed on Render.


Features

  • Upload and compare two documents
  • Overall similarity percentage using TF-IDF & cosine similarity
  • Sentence-level similarity detection
  • Simple and responsive UI
  • Deployed on cloud (Render)
  • Secure file handling (temporary storage & cleanup)

Tech Stack

Backend

  • Python
  • Flask
  • Flask-CORS
  • scikit-learn (TF-IDF, cosine similarity)
  • pdfplumber
  • python-docx
  • BeautifulSoup

Frontend

  • HTML5
  • CSS3
  • Vanilla JavaScript (Fetch API)

Deployment

  • Render (Free Tier)
  • Gunicorn

How It Works

  1. User uploads two documents
  2. Text is extracted from each file
  3. Text is normalized and cleaned
  4. TF-IDF vectors are generated
  5. Cosine similarity is calculated
  6. If similarity is high, similar sentences are highlighted

Supported File Types

File Type Status
.txt Supported
.docx Supported
.pdf Supported (small files)
Images (.png, .jpg) Disabled on cloud

⚠️ Image OCR is disabled on the deployed version due to cloud infrastructure limitations.


Live Demo

🔗 Live URL:
https://textsure.onrender.com

Note: On Render’s free tier, large PDF files may fail due to CPU and timeout limits. This does not affect the core logic of the application.


🧪 Local Setup (Run on Your Machine)

Clone the Repository

git clone https://github.com/Vrinda2403/TextSure.git
cd TextSure

Create and activate virtual environment

python -m venv venv
venv\Scripts\activate

Install Dependencies

pip install -r requirements.txt

Run the App

python app.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors