GitHub - Cvikli/DiffLib.jl: Creating diff that supports wildcard produced by LLMs

DiffLib.jl

Parse LLM's codeblock and let's create a git diff against your own codeblock. That is why this diff support WILDCARDS too!

Improved diff: this tool propose CHARACTER and LINE based diff based on the modification amount and percentage.

NOTE: LLMs can create even better diffs with their wildcard. So all in all I suggest to create the extended version of the file with an LLM diff and then run this script to get very nice diffs.

Installation

using Pkg
Pkg.add(url="https://github.com/Cvikli/DiffLib.jl")

DEMO & Usage

julia -e "using DiffLib; run_cli()" test_cases/case0.js test_cases/case0_changes.js -d -w "// ..."

or get the diff like git diff --word-diff does:

julia -e "using DiffLib; run_cli()" test_cases/case0.js test_cases/case0_changes.js -w "// ..."

Or in code:

using DiffLib

# Compare two files
diff_files("test_cases/case0.js", "test_cases/case0_changes.js", "// ... ")

# Compare content strings
diff_contents(original_content, changed_content, ["WILDCARD"])

Features

LLM codeblock output + original codeblock diff
The diff is Word-based and character-based diff
Wildcard support for flexible matching
CLI for easy file comparison from terminal
Customizable output formatting by setting threashold of char or line based diff usage

REASON

LLMs can generate abbreviations, also these can be forced to be generated to faster output:
- // ... existing code ...
- // ... existing imports ...
- // ... rest of the component ...
- // ... rest of the component remains the same
- // ... rest of the existing styles ...
- // ... rest of the existing code ...
- // ... (rest of the code remains unchanged)
- // ... other styled components remain the same
- // ... (previous code remains unchanged)
- // ... imports remain the same
- // ... rest of the component (remove any font-size: 20px - declarations) ...
- // ... (keep other code unchanged)
- // ... (keep other styled components and imports unchanged)
- // ... existing JSX ...
- // ... existing useEffect and functions ...
- // ... (keep existing state variables)
- // ... (keep existing values)
- // ... (keep existing code)
- // ... (keep existing dependencies)
- // ... existing error handling ...
- // ... rest of the component ...
- // ... (previous dependencies)
- // ... (previous code)
- // ... (previous values)
- // ... (rest of the file)

This sounds pretty impossible to parse in each case. So I made this beginning match to be the pattern // ... . If only one string is defined then we use the startswith(wildcard, line)

The git diff often fail to find the diff... also many other diff fails in case of LLMs output.
Also why don't we have more granular diff like word or even character based diff... why should we look for a whole line to find the changes? right? We are humans with limited cognitive speed. :D

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
src		src
test		test
test_cases		test_cases
.gitignore		.gitignore
LICENSE		LICENSE
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiffLib.jl

Installation

DEMO & Usage

Features

REASON

License

TODO

How was this created?

About

Releases

Packages

Languages

License

Cvikli/DiffLib.jl

Folders and files

Latest commit

History

Repository files navigation

DiffLib.jl

Installation

DEMO & Usage

Features

REASON

License

TODO

How was this created?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages