How It Works — Bulk Contact Extractor

What the tool does

You upload a file — a CSV, Excel spreadsheet, or text file — containing a list of website URLs or company domains. The tool visits each website, finds publicly visible contact information, and returns a single downloadable file with one row per domain.

Each row contains the domain, company name, all emails found, phone numbers, postal addresses, social profile links, and the page URLs where the data was discovered.

The crawling process, step by step

Homepage fetch — The tool fetches the homepage and extracts any contact information it contains. It also collects all internal links.
Priority page probing — It automatically tries 18 known contact slugs: /contact, /contact-us, /about, /about-us, /impressum, /imprint, /team, /locations, and several others.
Keyword link follow — From the links found on the homepage, it follows any that contain words like "contact", "about", "support", "office", "reach us", or "get in touch".
Depth and page cap — The crawl goes at most 2 levels deep and fetches at most 10 pages per domain.
Merge and deduplicate — Contact data from all visited pages is merged into a single record. Duplicate emails and phone numbers are removed automatically.

What data is extracted

Email addresses

Sources: mailto: links, visible page text, and JSON-LD structured data. Noreply, postmaster, bounce, and automated addresses are filtered out.

Phone numbers

Sources: tel: links, Schema.org telephone fields, visible text. Every candidate is validated using the libphonenumber library and normalised to international format.

Postal addresses

Sources: Schema.org PostalAddress structured data, <address> HTML elements, and footer text blocks containing a postal code pattern.

Social profiles

LinkedIn, Twitter/X, Facebook, Instagram, GitHub, YouTube, and TikTok links are collected. Share buttons and login links are excluded — only profile URLs are kept.

Known limitations

No JavaScript rendering — Pages are fetched without executing JavaScript. Emails loaded by JS will not be found.
Blocked by CAPTCHA or rate limiting — Some websites block automated crawlers. The output includes a blocked status for these.
Contact forms only — If a company uses a contact form and has no email visible, no email will be found.

Privacy and data handling

Your uploaded file is stored temporarily and used only to run the extraction. It is never shared with third parties, never used for advertising, and automatically deleted after 7 days.

Technical limits

Maximum file size: 50 MB
Maximum pages per domain: 10
Concurrent domains: 10
Rate limit: 10 uploads per hour per IP
Result retention: 7 days