Data Engineer (Java and Python)

  • Sonatype
  • Colombia
  • 02 Apr, 2024
Full time

Job Description

Sonatype is the software supply chain management company. We're on a mission to change how the world innovates by making software development easier. From running the world's largest repository of Java open-source components (Maven Central) to inventing componentized software development and then software supply chain management to creating the only solution that stops malicious open-source malware in its tracks, we're constantly leading the industry while helping thousands of customers manage open source every day.

Already used by 15 million developers, we have lofty goals for our technology to be in the hands of every engineering team. And we need you to do that. Join us!

Learn more at www.sonatype.com.


We are 350+ employees from diverse backgrounds, who hail from more than 50 countries, and speak 15 languages. But, we all share one thing in common: we’re passionate about accelerating software innovation. Our vision is to put Nexus products at the center of every open source decision made by modern engineering organizations.  We’re one of the fastest growing tech companies in America and have been named both a Deloitte Fast 500 and Inc. 5000 company three years in a row. We pride ourselves on being an open and supportive company, which is why we were named to Fast Company’s list of 50 Best Workplaces for Innovators in 2019. Learn more at www.sonatype.com.

Sonatype’s mission is to enable organizations to better manage their software supply chain.  We offer a series of products and services including the Nexus Repository Manager and Nexus Lifecycle Manager.

We are a talented product development group and we work in small, remote, collaborative teams to create high-quality products. Thousands of organizations and millions of developers use our software. If you have a passion for challenging problems, software craftsmanship and having an impact, then Sonatype is the right place for you.

Overview:

We are seeking a highly motivated and detail-oriented Data Engineer to join our Colombian team. The ideal candidate possesses a strong foundation in data engineering principles and is eager to contribute to the development and maintenance of our data infrastructure. As a Data Engineer, you will work closely with many cross functional teams to ensure the efficient and effective processing, storage, and retrieval of large volumes of data.

 

What you'll be doing:

  • You'll work in one of our data processing teams to create and manage data pipelines. 
  • You'll work on products that continually ingest data from open source software components, aggregate, and analyze it.  
  • You'll deliver data to our customers from which insights can be driven and decisions can be made of how to use open source software components. 
  • You'll monitor and observe our data ingestion pipelines in a production environment. 
  • You’ll collaborate with cross-functional teams to ensure seamless data flow and integration into our data warehouse.
  • You’ll optimize data processing and transformation algorithms to improve efficiency and performance.
  • You’ll practice modern engineering practices such as CI/CD, Automated Testing, IaC and continuous monitoring.

 

Job Requirements:

  • 3 + years of overall software engineering experience. 
  • 3+ years of  backend or data engineering with at least one programming language commonly used in data engineering (e.g., Python, Java, Scala).
  • Very strong ability translate technical requirements in English to non-technical business partners.
  • Knowledge of data modeling and database design principles.
  • Excellent problem-solving skills and attention to detail.
  • Ability to work effectively both independently and as part of a team.
  • Strong written and verbal communication skills.
  • A Bachelor's degree in Computer Science, Engineering, or a related field.

 

Preferred Qualifications:

  • Experience with high volume data ingestion pipelines 
  • Understanding of machine learning concepts and their integration with data pipelines.
  • Previous experience working in a data engineering role or related field.
  • Knowledge and experience with cloud-based data platforms (e.g., AWS, Google Cloud, Azure).
  • Knowledge and experience with large scale data tools and techniques (ex:, Databricks, Spark, Hadoop, Hive, MapReduce ...).
  • Knowledge and experience with non relational databases (ex: DynamoDB, HBase, MongoDB, Cassandra, ...).
  • Knowledge and experience working with queues and pipelines (ex: SNS, SQS, RabbitMQ, Kafka, ...).

 

What we offer:

  • The opportunity to be part of an incredible, high-growth company, working on a team of experienced colleagues.
  • Competitive salary package
  • Medical/Dental/Vision benefits
  • Business casual dress
  • Flexible work schedules that ensure time for you to be you.
  • 2019 Best Places to Work Washington Post and Washingtonian.
  • 2019 Wealthfront Top Career Launch Company
  • EY Entrepreneur of the Year 2019
  • Fast Company Top 50 Companies for Innovators
  • Glassdoor ranking of 4.9
  • Come see why we've won all of these awards
If you are passionate about data engineering and eager to contribute to a dynamic and innovative team, we invite you to apply for the Associate Data Engineer position.

At Sonatype, we value diversity and inclusivity. We offer perks such as parental leave, diversity and inclusion working groups, and flexible working practices to allow our employees to show up as their whole selves. We are an equal-opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you have a disability or special need that requires accommodation, please do not hesitate to let us know.


#LI-Remote