Use Cases

Lead Generation with Web Scraping: The Ultimate B2B Guide

Master B2B lead generation using web scraping. Learn to extract business contacts from LinkedIn, Google Maps, directories, and build targeted prospect lists.

14 min read

Business lead generation prospect list built from web-scraped data

As an Apify affiliate, we may earn a commission from qualifying purchases made through our links, at no extra cost to you. We only recommend tools we believe in.

import BlogCTA from ’../../components/BlogCTA.astro’;

In B2B sales, your pipeline is only as good as your leads. The problem? Quality leads are expensive, time-consuming to find, and often outdated by the time you reach out. With the lead generation market projected to reach $295 billion by 2027, competition for quality prospects is fierce. Web scraping changes the game by automating the entire lead generation process.

This guide shows you how to build a scalable lead generation system using web scraping—from identifying sources to enriching and validating your data.

The Lead Generation Challenge

Traditional Methods Are Broken

2025 Cost Per Lead Benchmarks:

MethodCost per LeadQualityScale
Manual research$50-100HighVery Low
Purchased lists$0.10-1Low-MediumHigh
Inbound marketing$30-50HighMedium
LinkedIn Ads$110-408HighMedium
Google Ads$70.11 (up 5% from 2024)MediumHigh
Web scraping$0.01-0.10HighVery High

The average B2B cost per lead across all industries ranges from $84-198 in 2025, making web scraping 100-1000x more cost-effective.

Why Web Scraping Wins

  1. Fresh data - Extract current information, not 6-month-old lists
  2. Custom targeting - Define exact criteria for ideal customers
  3. Scalability - Generate thousands of leads per hour
  4. Cost efficiency - 10-100x cheaper than buying lists
  5. Competitive edge - Access data others can’t buy

Best Sources for B2B Leads

1. Google Maps & Local Business Listings

Best for: Local services, retail, restaurants, professional services

Data available:

  • Business name and address
  • Phone number (often direct line)
  • Website URL
  • Business hours
  • Reviews and ratings
  • Owner responses to reviews

Use case example: An accounting firm targeting restaurants in Miami:

Search: "restaurants Miami FL"
Extract: Name, phone, website, review count
Filter: 50+ reviews, 4+ star rating
Result: 500 qualified prospects

2. LinkedIn (Company & People)

Best for: B2B SaaS, consulting, enterprise sales

Data available:

  • Company size and industry
  • Decision-maker names and titles
  • Work history and education
  • Company updates and job postings
  • Technologies used (from job descriptions)

Legal Note (2025): The hiQ Labs v. LinkedIn ruling confirms that scraping publicly available data does not violate the Computer Fraud and Abuse Act (CFAA). However, it may still violate LinkedIn’s Terms of Service, so use caution and consider API alternatives where available.

Pro tip: Job postings reveal pain points. A company hiring for “CRM administrator” needs CRM solutions.

3. Industry Directories

Best for: Niche markets, regulated industries

Examples:

  • Clutch.co (agencies and IT services)
  • Crunchbase (startups and tech companies)
  • Yellow Pages (local businesses)
  • Industry-specific directories (legal, medical, etc.)

4. E-commerce Platforms

Best for: Selling to online sellers, SaaS for e-commerce

Platforms to scrape:

  • Amazon seller storefronts
  • Etsy shops
  • Shopify store directories
  • eBay power sellers

5. Job Boards

Best for: HR tech, recruiting, B2B services

Data signals:

  • Company is hiring = Company is growing
  • Specific roles = Specific needs
  • Multiple openings = Well-funded

6. Company Websites

Best for: Contact information, technology detection

Extract:

  • Contact page details
  • Team/About page (decision makers)
  • Technology stack (from source code)
  • Pricing pages (budget indicators)

Building Your Lead Generation Pipeline

Phase 1: Define Your Ideal Customer Profile (ICP)

Before scraping, define who you’re looking for:

Company Criteria:
- Industry: SaaS, Technology
- Size: 50-500 employees
- Location: United States
- Revenue: $5M-50M ARR
- Signals: Recently funded, hiring

Contact Criteria:
- Title: VP Sales, Head of Sales, Sales Director
- Decision level: Director+
- LinkedIn presence: Active

Phase 2: Source Selection

Match sources to your ICP:

ICP CharacteristicBest Source
Local businessesGoogle Maps
Tech companiesCrunchbase, LinkedIn
E-commerce brandsShopify directories
Professional servicesIndustry directories
Growing companiesJob boards

Phase 3: Data Extraction

Use our Google Maps Scraper or Contact Extractor to:

  1. Search by criteria - Industry, location, keywords
  2. Extract base data - Names, addresses, websites
  3. Collect contact info - Phones, emails, social links
  4. Capture signals - Reviews, ratings, activity

Phase 4: Data Enrichment

Raw scraped data needs enhancement:

Enrich with:

  • Email addresses (from websites)
  • LinkedIn profiles (from company names)
  • Company size (from LinkedIn)
  • Technologies used (from BuiltWith, Wappalyzer)
  • Funding data (from Crunchbase)

Phase 5: Validation & Scoring

Before outreach, validate and score leads:

Validation checks:

  • ✅ Email deliverability (use verification tools)
  • ✅ Phone number format and validity
  • ✅ Website is active
  • ✅ Company still in business

Scoring model:

Lead Score = 
  (Company Size Match × 20) +
  (Industry Match × 25) +
  (Title Seniority × 20) +
  (Engagement Signals × 15) +
  (Data Completeness × 10) +
  (Recency × 10)

Step-by-Step: Google Maps Lead Extraction

Example: Finding Dental Practices for a Dental Software Company

Step 1: Configure search

Search Query: "dental clinic" OR "dentist office"
Location: Los Angeles, CA
Radius: 50 miles
Max Results: 1000

Step 2: Select data fields

  • ✅ Business name
  • ✅ Address
  • ✅ Phone number
  • ✅ Website
  • ✅ Rating
  • ✅ Review count
  • ✅ Business hours

Step 3: Run extraction Processing time: ~10-15 minutes for 1000 results

Step 4: Export and analyze

Total results: 847 dental practices
With websites: 723 (85%)
With phone: 841 (99%)
4+ star rating: 612 (72%)
50+ reviews: 389 (46%)

Step 5: Filter for quality Final qualified leads: 389 practices with:

  • 4+ star rating (quality indicator)
  • 50+ reviews (established practice)
  • Website (can research before calling)

Email Extraction Strategies

From Company Websites

Use the Contact Extractor to:

  1. Crawl contact pages - /contact, /about, /team
  2. Extract email patterns - info@, sales@, [name]@
  3. Identify decision makers - Team pages with names
  4. Capture forms - Contact forms indicate preferred channels

Email Pattern Generation

If you have names but no emails, use common patterns:

PatternExample
first@company.comjohn@acme.com
first.last@company.comjohn.smith@acme.com
flast@company.comjsmith@acme.com
firstl@company.comjohns@acme.com

Verify before sending using email verification services.

Compliance and Ethics

GDPR Considerations

If targeting EU businesses:

  • ✅ Business contact data is generally compliant
  • ✅ B2B outreach has legitimate interest basis
  • ❌ Don’t scrape personal/consumer data
  • ⚠️ Honor unsubscribe requests immediately

CAN-SPAM Requirements

For US email outreach:

  • ✅ Include physical address
  • ✅ Clear unsubscribe mechanism
  • ✅ Honest subject lines
  • ✅ Identify message as ad if applicable

Best Practices

  1. Only use business data - Not personal information
  2. Respect robots.txt - Follow website guidelines
  3. Don’t spam - Quality over quantity
  4. Provide value - Your outreach should help, not annoy
  5. Honor opt-outs - Immediately and permanently

Measuring Lead Generation ROI

Key Metrics

MetricFormula2025 Benchmark
Cost per lead (paid ads)Total cost ÷ Leads generated$84-198 (B2B avg)
Cost per lead (scraping)Total cost ÷ Leads generated$0.01-0.10
Lead-to-MQL rateMQLs ÷ Total leads15-30%
MQL-to-SQL rateSQLs ÷ MQLs30-50%
SQL-to-close rateCustomers ÷ SQLs15-25%
Customer acquisition costTotal cost ÷ CustomersVaries by industry

Example ROI Calculation

Web scraping cost: $100/month
Leads generated: 2,000
Cost per lead: $0.05

Conversion funnel:
- 2,000 leads
- 400 MQLs (20%)
- 160 SQLs (40%)
- 32 customers (20%)

Average deal value: $5,000
Revenue: $160,000
ROI: 160,000 ÷ 100 = 1,600x

Integration with Sales Tools

CRM Integration

Export leads directly to:

  • Salesforce - Via CSV import or API
  • HubSpot - Native integrations available
  • Pipedrive - Bulk import feature
  • Zoho CRM - Multiple import options

Outreach Automation

Connect with:

  • Outreach.io - Automated sequences
  • Apollo.io - Email and call cadences
  • Lemlist - Personalized email campaigns
  • Reply.io - Multi-channel outreach

Data Orchestration

Use automation platforms:

  • Zapier - Connect scrapers to CRMs
  • Make - Complex workflow automation
  • n8n - Self-hosted automation

Common Mistakes to Avoid

  1. Quantity over quality - 100 good leads beat 1,000 bad ones
  2. No validation - Sending to invalid emails hurts deliverability
  3. Generic outreach - Personalization is essential
  4. Ignoring signals - Recent activity = better timing
  5. One-and-done scraping - Regular updates keep data fresh
  6. Legal ignorance - Know your compliance requirements

Getting Started Checklist

Week 1: Foundation

  • Define your ICP (Ideal Customer Profile)
  • Choose 2-3 data sources to start
  • Set up a scraper account
  • Create your first data extraction

Week 2: Process

  • Build enrichment workflow
  • Set up email validation
  • Create lead scoring model
  • Design CRM import process

Week 3: Activation

  • Import first batch to CRM
  • Launch initial outreach
  • Track response rates
  • Iterate on messaging

Ongoing

  • Schedule weekly scrapes
  • A/B test outreach approaches
  • Refine targeting criteria
  • Expand to new sources

Ready to supercharge your lead generation? Start with our Google Maps Scraper for local businesses or Contact Extractor for website data.

Need a custom lead generation solution? Contact us to discuss your specific requirements.

Share this:

Tags

#lead generation #b2b #sales #prospecting #business data #linkedin
✍️

ParseFlow

Automation Expert & Technical Founder

Specializing in web scraping, browser automation, and data harvesting solutions. Helping businesses scale with automated insights.