Team |
Location |
ENGINEERING |
Remote |
What You Will Be Doing:
- Web Scraping and Data Extraction:
- Execute existing scripts for data scraping.
- Extract data from various websites and APIs using Python libraries such as Requests, Selenium, and Beautiful Soup.
- Data Transformation and Cleaning:
- Transform, clean, and standardize data to ensure high quality and consistency.
- Utilize Python and Pandas, Excel for data manipulation and analysis.
- Data Quality Checks:
- Develop and implement manual checks to ensure data accuracy and completeness.
- Perform manual quality checks to identify and rectify data inconsistencies and inaccuracies.
- Use Excel's nested text functions and logical operations to find and correct data issues.
- Data Enrichment:
- Conduct secondary research to enrich data fields with additional information.
- Enhance datasets by integrating supplementary data sources.
- Multiple File Handling and Management:
- Efficiently manage and handle multiple Excel and CSV files.
- Perform file operations such as merging, splitting, and cleaning data from multiple sources.
- Maintain organized file structures for easy access and retrieval.
Experience & Requirements:
- Over one year of experience in data engineering or data analysis.
- Proficiency in Python and experience with web scraping libraries such as Requests, Selenium, and Beautiful Soup.
- Strong knowledge of data manipulation and analysis using Pandas and Excel.
- Hands-on experience with Excel, including the use of nested text functions and logical operations.
- Proficiency in handling and managing multiple Excel and CSV files.
- Proficiency in SQL for data querying and manipulation.
Required Skills & Mindsets:
- Programming Languages: Python
- Libraries: Pandas, Requests, Selenium, Beautiful Soup
- Tools: Excel, SQL