Overview:
SlickFlo is hiring a talented Web Scraper Developer to design and implement a high-performance, production-ready web scraping solutions for multiple websites. The ideal candidate should be an expert in building scalable, maintainable scraping tools using Python, with strong knowledge of Dockerized environments and experience running workloads inside Google Cloud Platform (GCP). This is a full-time position with remote environment.
Role Type: Full Time
Location: Remote
Timings: US Time Zone (Mostly EST)
Responsibilities:
- Develop robust and efficient Python-based web scraping scripts capable of running reliably in Docker containers on Google Cloud.
- Implement proxy rotation to ensure anonymity and scraping continuity, with randomized proxy selection per run.
- Diagnose and update scrapers as needed to accommodate website changes or other modifications, ensuring uninterrupted data extraction.
- Integrate Pydantic for structured data validation and schema management.
- Write clean, well-documented, and modular code with proper type hints and error handling.
- Optimize performance to handle dynamic websites, including those with JavaScript-rendered content (e.g., via Playwright, Selenium, Pydoll or Puppeteer-based frameworks).
- Configure environment and dependency management using uv or similar dependency managers.
- Ensure the script is easily deployable and configurable through Docker environment variables.
- Collaborate briefly during testing to validate the scraper’s correctness and performance.
- Create robust data pipelines to export data to NoSQL databases like MongoDB.
Skills & Qualifications:
- Strong proficiency in Python 3.9+.
- Proven experience (minimum 2 years) building web scraping tools using frameworks like Playwright, Selenium, Pydoll or equivalent, with support for headed mode inside Docker.
- Deep understanding of proxy management and rotation techniques.
- Hands-on experience with Docker (Dockerfiles, multi-stage builds, debugging containerized apps).
- Solid grasp of Pydantic and Python type hints for data modeling and validation.
- Familiarity with asynchronous programming (asyncio) and concurrent task management for scraping performance.
- Proficient in detecting and utilizing localized APIs exposed through DevTools for efficient data extraction.
- Strong debugging and problem-solving skills when dealing with site structure changes, blocking, or rate limits.
- Basic understanding of MongoDB & NoSQL databases in general.
Nice to Have:
- Experience deploying workloads in Google Cloud Run, Cloud Build, or GKE.
- Experience using Beanie ODM or similar libraries for object-document mapping.
- Understanding of logging, monitoring, and error alerting within containerized environments.
- Prior experience with CI/CD pipelines for automating build and deployment of scrapers.
- Knowledge of headless browser detection evasion and fingerprinting avoidance strategies.
About You:
You are a detail-oriented developer who values clean, maintainable code and takes pride in building resilient automation tools. You understand how to balance efficiency, reliability, and stealth when scraping modern web applications.
If you are confident in your ability to deliver a solid, production-ready scraper ready to run in the cloud, we’d love to hear from you
How to Apply:
- Interested candidates should submit their resume to hr.slickflo@gmail.com.
- Email’s subject must mention “Candidate Name – Web Scraper Developer”.
- Candidates must mention their notice period (if currently employed) and expected salary (PKR) in the email body.
