We are looking for a Data Engineer to be part of our team that is working for clients in Brazil, Europe and the United States. In this position, you will design, implement and support data pipelines working closely with teams in the EU, US, Brazil and other countries. You will have the opportunity to improve your English skills by communicating with these different teams and clients throughout the workday.
We are looking for someone who has:
- Fluent English both written and spoken;
- Solid experience in designing, building and testing data pipelines
- Solid experience using Infrastructure as Code in Python for data (collection, cleaning, transformation) with reading and writing of data frames in several formats;
- Comfort learning new frameworks;
- Experience with Cloud Azure;
- Willingness to work with different technologies and learn every day;
- Ability to share your knowledge with more junior team members.
The main responsibilities include, but are not limited to:
- Develop code used by data tools on Cloud (Azure) to perform expected data aggregation and processing operations.
- Support, troubleshoot and fix bugs on existing data pipelines.
- Define, build and deliver high-quality data pipelines.
- Participate in the Data Validation process.
- Design, construct, install, test and maintain data management systems.
- Ensure that all systems meet the business/company requirements as well as industry practices.
- Integrate up-and-coming data management and software engineering technologies into existing data structures.
- Develop set processes for data mining, data modeling, and data production.
- Recommend different ways to constantly improve data reliability and quality.
- Analyze and organize raw data.
- Build data-marts that will be consumed by data scientist’s analysis and data visualization tools
We expect candidates to have knowledge in some, but not necessarily all fields below:
- Experienced in PowerBI;
- Knowledge of non-functional requirements (performance, security, privacy, GDPR and etc)
- Knowledge of unit, integration and E2E testing
- Cloud Certification (Azure or AWS)
- Data Streaming (Kafka/EventHub, Spark/Stream Analytics)