Skip to main content
Posted August 26, 2022
Metrika

Senior Data Engineer

Remote job Remote Full Time

It’s an exciting time to join Metrika! A Series A-funded startup in growth-mode with teammates across the US, Canada, UK, and Europe, we are building...

It’s an exciting time to join Metrika! A Series A-funded startup in growth-mode with teammates across the US, Canada, UK, and Europe, we are building the world's premier operational intelligence platform for blockchain networks. Metrika partners with blockchain protocols, foundations, and node runners to help them and their community members analyze individual and network-wide metrics of their Distributed Ledger Technology (DLT) networks to maintain and improve their performance, security, and reliability.

These are the early days of our platform, and as a Senior Data Engineer you will be able to contribute, influence, and take ownership in significant parts of our systems. Our goal is to build a very high performance platform, capable of analyzing thousands of transactions across multiple blockchain networks in real-time.

If you are a Senior Data Engineer, with a solid understanding of data lakes, data warehouses, ETL, distributed systems, passion for your work and would love to work with a geographically distributed team, in an emerging industry join us! No prior experience in blockchain necessary, but an interest in learning and being deeply immersed is.

What you'll be doing:

  1. Designing, implementing and maintaining data processing pipelines — this includes ingestion, clean up, transformation, aggregation, batch and streaming jobs, as well managing the data lifecycle to ensure affordable and performant long-term storage across our data stores and data lake. You will work closely with our software engineers, SREs and our Analytics team to make sure data smoothly flows across Metrika and beyond to our customers and users.

  2. Working under a Scrum or Kanban framework.

  3. Owning your work. This means being proud of your work, actively striving for excellence, observing the best practices of your craft and always aiming to improve your skill.

  4. Understanding, participating and contributing to the company goals, regardless of your role. Metrika is a small company with a very inclusive culture. We are looking for people that share those values with us.


Please note: Our Engineering team is predominantly based in Europe and the eastern United States. This position is currently open to those resident and currently able to work in the European Economic Area (EU, Norway, Liechtenstein), Switzerland, the UK as well the eastern United States/Canada (UTC-4/UTC-5 timezone).


Metrika Inc. is an Equal Opportunity employer. All applicants will be considered without regard for race, color, national origin, ethnicity, gender, disability, sexual orientation, gender identity, or religion.


We are looking for individuals with:

  1. A Bachelor's degree in Computer Science, Electrical Engineering, Physics or Mathematics. Masters or higher degrees preferred.

  2. Multi-year experience in data engineering, in large-scale production environments.

  3. At Metrika we mostly use Python for data processing; most of our ETL/Data processing jobs are written in Python. You will need to have some familiarity with scheduling systems (e.g. Airflow, luigi etc.), data transformation tools (e.g. dbt), distributed compute frameworks (e.g. Apache Spark, Apache Flink, ray.io etc.), and a solid understanding of the concepts of data governance, data lineage/provenance.

  4. Excellent understanding of TDD, agile development methodology and version control.

  5. The ability to function autonomously to solve problems, and deliver working software. Our remote environment and geographic distribution requires people that can work well on their own.

  6. The ability to communicate well with your team, both interactively and asynchronously, and that of being a positive, constructive team member.


You'll be a great fit if you have:

  1. Worked and contributed to a Big Data production environment, handling multiple GB of data per day.

  2. Good knowledge of Python.

  3. Experience with Apache Spark, Apache Flink, Ray.io and Airflow,

  4. Experience with using and building CI/CD pipelines

  5. Experience with Docker/Kubernetes or Serverless environments.

  6. Experience with SQS/SNS, Apache Kafka, RabbitMQ or other brokers.

  7. Experience with public cloud providers, e.g. AWS, GCP, Azure, DigitalOcean etc.

  8. Experience with blockchain systems.

Once you submit your application, you will receive an automated email from the recruitee.com domain within a few minutes acknowledging we have received your application. If you do not receive this email within a few minutes, please check your spam folder or other filtered folders. And to ensure our future communications reach you, please add emails from the recruitee.com domain to your safe list.

This listing expired on Sep 06. Applications are no longer accepted.

Below are some other jobs we think you might be interested in.