We are seeking a talented Python Developer with expertise in ETL (Extract, Transform, Load) processes and Apache Airflow. The candidate will be responsible for designing and implementing robust and efficient data pipelines, ensuring high data quality, and contributing to the continuous improvement of our data management practices. This is a remote position supporting a Washington, DC based team.
Essential Duties and Responsibilities:
- Design, develop, and maintain ETL processes using Python and Apache Airflow.
- Collaborate with data analysts and other stakeholders to understand and meet their data requirements.
- Develop and implement data validation processes to ensure high data quality.
- Troubleshoot and resolve issues related to data pipelines.
- Optimize data extraction, transformation, and loading processes to improve efficiency and performance.
- Document and maintain the design and details of data processes and schemas.
- Stay updated with the latest industry trends and technologies to ensure our data practices remain current.
Required Skills:
- Proficiency in Python: A strong understanding of Python programming language is crucial. This includes knowledge of Python libraries and frameworks like Pandas, NumPy, and Django.
- Expertise in Apache Airflow: Experience in designing, building, and maintaining data pipelines using Apache Airflow. Knowledge of Airflow's architecture, including DAGs and Operators, is important.
- ETL Processes: Proficiency in ETL (Extract, Transform, Load) processes, including data extraction from various sources, data transformation (cleaning, validating, aggregating, joining, etc.), and loading data into the end target (database or data warehouse).
- Database Knowledge: Strong understanding of SQL and NoSQL databases, including writing complex queries and database optimization techniques.
- Data Warehousing: Experience with data warehousing solutions like Amazon Redshift, Google BigQuery, or Microsoft Azure SQL Data Warehouse.
- Strong communication and collaboration skills.
- Excellent problem-solving skills.
Desired Skills:
- Knowledge of data modeling and data warehousing.
Education and Certifications:
- Bachelor's degree in Computer Science, Information Systems, or a related field.
Application deadline: 10-25-2024
The proposed salary for this position is: $96,800—$145,200 USD
SMX is committed to hiring and retaining a diverse workforce. All qualified candidates will receive consideration for employment without regard to disability status, protected veteran status, race, color, age, religion, national origin, citizenship, marital status, sex, sexual orientation, gender identity or expression, pregnancy or genetic information. SMX is an Equal Opportunity/Affirmative Action employer including disability and veterans.
Selected applicant will be subject to a background investigation.
#J-18808-Ljbffr