Question 1

How does text comparison work?

Accepted Answer

The tool uses a Longest Common Subsequence (LCS) diff algorithm. It tokenizes both inputs into the smallest units chosen by your comparison mode (lines, words, paragraphs, or characters), then computes the minimal edit sequence that transforms the original into the modified version. Each token is classified as equal, added, removed, or modified, and the results are rendered side-by-side with colour-coded highlights.

Question 2

What does the similarity score mean?

Accepted Answer

Similarity is the ratio of unchanged tokens to total tokens, expressed as a percentage. 100% means the two inputs are identical (under your current ignore options). 0% means they share nothing in common. The same texts can score differently in different modes — two paragraphs that share most words but rearrange them will score higher in word mode than line mode, for example.

Question 3

What is merge mode?

Accepted Answer

Merge mode lets you resolve every difference between the two inputs by choosing which version to keep. For each conflict you can pick Accept Left (keep the original), Accept Right (take the modified version), or Accept Both (concatenate them). The Merged Output panel rebuilds the final document in real time as you make each choice — perfect for resolving competing edits from two reviewers or merging two drafts.

Question 4

Can I compare code files?

Accepted Answer

Yes — the tool reads any plain-text file under 1 MB, including SQL, JSON, XML, HTML, CSS, JavaScript, TypeScript, Markdown, YAML, and CSV. Use Line mode for line-by-line code diffs (the standard for git-style review) or Word mode to track inline edits like renames and signature changes. The diff is purely text-based — it doesn't parse code semantics, so reformatting will appear as a change.

Question 5

Which file types are supported?

Accepted Answer

TXT, CSV, JSON, XML, SQL, HTML, CSS, JS, JSX, TS, TSX, Markdown (.md), and YAML (.yml, .yaml). Anything that's plain text and under 1 MB will load. Binary files (images, PDFs, Word documents) are not supported — they would need to be converted to text first.

Question 6

How accurate is the comparison?

Accepted Answer

The LCS algorithm produces an optimal minimal edit script — it always finds the smallest set of changes that explains the difference. The 'feel' of the diff (whether two paragraphs read as 'modified' or 'one removed + one added') depends on your chosen mode and ignore options. For most use cases, Line mode with Ignore Whitespace gives the most intuitive code diffs; Word mode with Ignore Case gives the best results for prose.

Question 7

What are the ignore options for?

Accepted Answer

Ignore Whitespace masks spacing-only changes (great for code reformatting). Ignore Case masks letter-case-only changes (useful when comparing emails, identifiers, or content that's been recapitalized). Ignore Punctuation focuses on the words themselves — useful for prose comparison. Ignore Blank Lines skips empty separator lines, which is helpful when comparing documents with different vertical spacing.

Question 8

Why is the diff slow on very large files?

Accepted Answer

The LCS algorithm uses O(n × m) memory, where n and m are the token counts on each side. Two 1,000-line files compute instantly, but two 5,000-line files in character mode would need 25 million cells — which the tool blocks for safety. If you hit the size guard, switch to Line or Paragraph mode (much fewer tokens) or split the file into smaller sections.

Smart Text Compare & Merge

How to use

When to use text comparison

How it works

Frequently asked questions

Categories

Popular

Company

Features