Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Machine Learning Engineer - Data Pipeline (Dublin, CA) image - Rise Careers
Job details

Machine Learning Engineer - Data Pipeline (Dublin, CA)

About us: 

At Articul8 AI, we relentlessly pursue excellence and create exceptional AI products that exceed customer expectations. We are a team of dedicated individuals who take pride in our work and strive for greatness in every aspect of our business. We believe in using our advantages to make a positive impact on the world and inspiring others to do the same. 

Job Description: 

We are seeking machine learning engineers to join our team full-time. As part of your role, you will help us build pipelines of data collection, data extraction, data filtering/synthetic data generation and data analysis.  You will own all work related to acquiring high-quality data to power the training of our domain-specific models end to end.  You will work closely with other researchers and engineers to empower our next generation of domain-specific models.  We value rapid prototyping, iterating, and shipping new systems quickly.  

Responsibilities:

  • Design and develop data processing pipelines, including data extraction, data filtering, data labeling, etc.

  • Implement machine learning models to improve the quality and diversity of data (especially in the data extraction stage), e.g., quality classifier, document layout model, code verification model, etc.

  • Own and lead engineering projects in the area of data acquisition, including web crawling, data ingestion, and processing.

  • Collaborate with our Applied Research, Technology, and Architecture teams to ensure smooth data flow and system operability.

  • Develop and deploy highly scalable distributed systems capable of handling terrabytes of data.

  • Architect and implement algorithms for data indexing and search capabilities.

  • Build and maintain backend services for data storage, including work with key-value databases and synchronization.

  • Deploy solutions in a Kubernetes Infrastructure-as-Code environment and perform routine system checks.

Required Qualifications: 

  • BS/MS/PhD in Computer Science or a related field. 

  • Proficiency in at least one deep learning framework, such as PyTorch. 

  • Experience in machine learning projects in text or vision, e.g., has trained machine learning models to tackle a specific problem. 

  • Strong expertise in large stateful distributed systems and data processing. 

  • Strong proficiency in building large-scale data processing pipelines, familiar with distributed workload (e.g., multiprocessing, Ray, Docker, Kubernetes). 

  • Proficiency in at least one programming language commonly used in machine learning, such as Python and ability to write clean, maintainable code. 

  • Excellent problem-solving skills and attention to detail, especially when handling data anomalies and biases to further improve data quality. 

Preferred Qualifications: 

  • Active Github contributions are a big plus. 

  • Experience in building large-scale datasets. 

  • Familiar with at least one of the following tools for data crawling (e.g. Scrapy), data collection (e.g., VPNs, Selenium), data processing (e.g., Hadoop, Datasketch). 

  • Building bespoke data processing libraries from scratch. 

  • Keeping up with state-of-the-art techniques for preparing AI training data. 

  • Organizing and meticulously bookkeeping data across multiple clouds, of multiple modalities, and from many sources. 

  • Multilingual which contributes to enriching the language diversity crucial for robust model training. 

What We Offer:

By joining our team, you become part of a community that embraces diversity, inclusiveness, and lifelong learning. We nurture curiosity and creativity, encouraging exploration beyond conventional wisdom. Through mentorship, knowledge exchange, and constructive feedback, we cultivate an environment that supports both personal and professional development.

If you're ready to join a team that's changing the game, apply now to become a part of the Articul8 team. Join us on this adventure and help shape the future of Generative AI in the enterprise.

Average salary estimate

$130000 / YEARLY (est.)
min
max
$100000K
$160000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Posted 17 hours ago

Experienced engineers are invited to advance their careers at Duke Energy by managing complex electrical infrastructure projects and regulatory compliance as an Engineer III or Senior Engineer in Charlotte, NC.

Join General Dynamics Mission Systems as a Senior Advanced Systems Engineer, working remotely to innovate in state-of-the-art ground systems.

Photo of the Rise User

Lead the Document Processing engineering team at Harvey, a cutting-edge AI platform transforming legal services with scalable, high-performance systems.

Photo of the Rise User
Posted 11 days ago

As a key player on Zscaler’s Infrastructure Platform Engineering Team, you will build and support the reliability of our core cloud security platform.

Photo of the Rise User
General Motors (GM) Hybrid Milford, Michigan, United States of America
Posted 5 days ago

Contribute to GM's next-gen electric vehicles as a Senior Software & Control Developer specializing in thermal management control software within a dynamic Agile team.

Photo of the Rise User
Qualis Hybrid No location specified
Posted 10 days ago

Contribute your expertise in structural analysis as a Propulsion Systems Structural Analyst with Qualis LLC, supporting NASA's future manned launch vehicles.

Photo of the Rise User
Posted 9 days ago

Join Kleinfelder as a Structural Engineer to enhance infrastructure projects through innovative design and analysis.

Photo of the Rise User
Veolia Environnement SA Hybrid 700 E Jones St, Milwaukee, WI 53207, USA
Posted 2 days ago

A Commissioning Engineer role at Veolia North America to ensure systems meet design criteria and facilitate smooth asset turnover at Milwaukee water treatment facilities.

Photo of the Rise User
Posted 8 hours ago

Phoenix Tailings is hiring a Senior Electrical Engineer to drive the design and optimization of electrical systems in sustainable mining operations.

Photo of the Rise User
Anduril Industries Hybrid Lexington, Massachusetts, United States
Posted 10 days ago

Join Anduril Industries as a Senior ISR Mechanical Engineer and play a key role in developing advanced technology for military applications.

Posted 6 days ago

Join a leading Fortune 10 company as a Remote Teamcenter & NX Engineer specializing in configurations and customizations.

Join HappyRobot as a Forward Deployed Engineer and help us automate communication in the logistics sector with cutting-edge AI technology.

Photo of the Rise User
MasterBrand Cabinets LLC Hybrid 614 West 3rd Street, Ferdinand, INDIANA
Posted 10 days ago

Join MasterBrand as a Senior Manufacturing Engineer to drive innovation in manufacturing processes and improve product quality.

MATCH
Calculating your matching score...
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, unknown
DATE POSTED
June 23, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!