About This Tool
This tool allows you to analyze, filter, and cluster numeric frequency data based on percentage difference thresholds. It provides three different algorithms to process your data:
Filter Similar Numbers
Removes numbers that are too close to each other based on percentage difference threshold.
Cluster Similar Numbers
Groups numbers that have at least one neighbor within the threshold, discarding isolated numbers.
Find Largest Similar Group
Finds the largest group where every number differs less than the threshold from all others in the group.
Upload & Configure
How It Works
Filter Similar Numbers
Start with an ordered list of all numbers
Add the first number to the filtered set
For each subsequent number, check if it differs from all previously kept numbers by at least the threshold
If it differs enough, keep it; otherwise, discard it
Cluster Similar Numbers
Compare each number with all others to find similar pairs (difference < threshold)
Create initial clusters where each number has at least one similar neighbor
Merge clusters that have similar members between them
Keep only clusters with at least 2 members, discarding isolated numbers
Find Largest Similar Group
For each number in the dataset, try to build a group starting from that number
Add other numbers to the group only if they differ from all current group members by less than the threshold
Verify that every pair of numbers in the resulting group has a difference less than the threshold
Keep the largest valid group found across all starting numbers
Privacy and Data Protection
Data Management Information
This tool is designed to protect your privacy and ensure the security of the data you upload.
Automatic File Deletion
- Results, log, and test files are immediately deleted after download
- The original file is deleted after you have downloaded all results
- All files are automatically deleted after 24 hours
Security and Transparency
- Download buttons become disabled after use for visual feedback
- No data is shared with external services or third parties
- All processing takes place on the local server without external dependencies
Liability Disclaimer
This tool is provided "as is" without warranties of any kind, either expressed or implied.
- The use of this service is at the user's own risk and responsibility
- We assume no responsibility for the data entered, the accuracy of the results, or any consequences arising from the use of this tool
- By uploading files to this tool, the user assumes all legal responsibilities related to the content of the files and decisions made based on the results