5M+ Opportunities
Found for freelancers, driving 10x inventory growth
30k+ Nodes
Active distributed scraping nodes
Zero Ops
Serverless architecture with no scraping infra
The Inventory Bottleneck
Contra, a commission-free freelance marketplace, faced a classic chicken-and-egg problem: liquidity. To attract freelancers, they needed a massive volume of high-quality job opportunities. To attract clients, they needed active freelancers. Traditional scraping approaches were non-viable:- Platform Defenses: LinkedIn and X (Twitter) aggressively block data center IPs.
- Cost: Maintaining a proxy rotation infrastructure for millions of pages is prohibitively expensive.
- Context: Generic scrapers lack the “social graph” context required to find relevant opportunities.
Solution: Distributed Intelligence
We inverted the scraping model. Instead of a centralized server farm, we leveraged the users themselves. We built Indy.ai, a Chrome extension that acts as a distributed edge node for opportunity discovery.System Architecture
The system relies on a “parasitic” architecture (in the symbiotic sense) where the extension piggybacks on the user’s authenticated session to read content they are already viewing or that is available to them.Engineering Spotlight: Invisible Authentication
One of the critical UX requirements was “zero-config”. We didn’t want users to have to paste API keys or cookies. We implemented a seamless session detection mechanism. The extension monitors network requests to specific domains (linkedin.com, twitter.com) to passively detect active sessions.
[!NOTE] Privacy First: All data extraction happens locally or via ephemeral processing. We strictly adhere to a “read-only” policy regarding user credentials—we never store or transmit session cookies to our servers.
The Impact
The results were immediate and compounding. Because every new user brings their own unique social graph and IP address, the system scales linearly with user growth.- 5 Million+ Opportunities Found: We successfully scaled inventory by 10x, solving the marketplace liquidity problem.
- 30,000+ Active Nodes: Effectively creating a massive, distributed, residential proxy network for free.
- High Fidelity: Because data is sourced from real user feeds, it contains high-intent opportunities often invisible to public scrapers.
- 4.6/5 Star Rating: By bundling this utility with genuine value for the freelancer (automated job matching), we achieved high retention.
“Doug operated quickly & efficiently, and even proposed ways to improve the feature to exceed our expectations. 10/10!” — Allison Nulty, Head of Product, Contra
Why This Matters
This architecture demonstrates a shift from Centralized Extraction to Edge Discovery. By distributing the workload to the edge (the user’s browser), we bypassed the scaling limitations of traditional scraping and built a self-reinforcing data moat.View on Chrome Web Store
Join 30,000+ freelancers using Indy.ai