isparrow services Pvt ltd
Location
Ahmedabad | India
Job description
Overview:
The Python Web Scraper plays a crucial role in gathering and organizing data from various websites efficiently. Their primary responsibility is to develop and maintain web scraping tools and scripts to collect specific information for analysis and business purposes.
Key Responsibilities:
- Collaborate with stakeholders to identify data sources and requirements.
- Write efficient and scalable Python scripts to extract data from target websites.
- Implement and maintain web scraping pipelines and workflows.
- Develop and optimize data extraction techniques using tools like BeautifulSoup and Scrapy.
- Handle and resolve issues related to website changes or antiscraping measures.
- Perform data validation and cleansing to ensure accuracy and consistency.
- Automate data retrieval processes and monitor for quality assurance.
- Collaborate with data analysts and engineers to integrate scraped data into systems.
- Stay updated with web scraping best practices and evolving technologies.
- Document processes tools and best practices for knowledge sharing.
Required Qualifications:
- Bachelors or Masters degree in Computer Science Information Technology or related field.
- Proven experience in web scraping and data extraction using Python.
- Proficiency in HTML/CSS and XPath for data parsing and manipulation.
- Strong knowledge of data structures algorithms and web protocols.
- Experience with web scraping frameworks like BeautifulSoup Scrapy or Selenium.
- Ability to handle and troubleshoot issues related to website changes and antiscraping measures.
- Experience in working with databases and data storage technologies.
- Familiarity with version control systems like Git for code management.
- Strong problemsolving skills and attention to detail in data validation and cleansing.
- Excellent communication and collaboration skills for working in crossfunctional teams.
python,web scraping,data extraction,html/css,xpath
Job tags
Salary