Find and resolve duplicate records with fuzzy matching
Goes beyond exact matches to find duplicates that differ by typos, formatting, abbreviations, or missing fields. "Jon Smith" at "123 Main St" and "John Smith" at "123 Main Street" — caught and grouped for your review.
Create a skill called "Dedup Detective". When I give you a dataset and specify the columns to match on, find duplicate records using: (1) Exact matching on normalized values (trim whitespace, lowercase, remove punctuation). (2) Phonetic matching (Soundex, Metaphone) for name columns. (3) Edit distance (Levenshtein) for strings that might have typos. (4) Token overlap for address-style fields ("123 Main Street" vs "123 Main St"). (5) Combination scoring across multiple columns. For each candidate duplicate group, assign a confidence score (0-1). Present groups for my review, sorted by confidence. Let me approve or reject each group. For approved groups, ask which record to keep as the master and how to combine fields from duplicates. Save the matching rules so I can rerun on future data without re-approving the same patterns.
Exact dedup is easy. Fuzzy dedup is where analysts lose days. This skill
uses multiple matching strategies (phonetic, edit distance, token overlap)
to find records that are probably the same entity, groups them, and lets
you review before merging.
Fix broken CSVs before they break your pipeline
Drop any CSV on your Claw and get a clean, validated file back. It detects encoding issues, mixed delimiters, malformed rows, broken quoting, BOM markers, and type mismatches — then fixes everything and tells you what it changed.
Turn messy spreadsheets into clean, analyzable data
Handles the spreadsheets from hell — merged cells, multiple header rows, color-coded data, footnotes mixed with values, data starting at cell B7. Extracts clean tabular data from Excel files without forcing you to reverse-engineer the formatting first.
Update pricing across all your Airbnb listings without clicking through each one
Automate bulk rate changes across multiple Airbnb listings using your Claw. Useful for seasonal pricing updates, last-minute discounts, or syncing rates after a change in your hosting strategy.
Email and calendar without leaving your terminal
Full Gmail control via the gog CLI. Read, send, search, organize emails. Create events, set reminders, RSVP to invitations. All from natural language or CLI commands.