Software Engineer Intern - Fuzzy Distinct

return to results

Company:

Dataiku

Place:

Details of the offer

Internship goal
Augment Dataiku data preparation by improving features on data records
Detailed description
Today, Dataiku boasts a robust data preparation framework that functions admirably to process a vast amount of data, helping users to have clean databases with the right data (and only the right data) inside them. However, we believe that with your help, we can take it a step further!
In a world where databases can be filled by real humans, data is not always clean. Errors can happen, typos can be made, and sometimes, you want to merge two database tables containing the same information, but not quite in the same format. "Dataiku", "dataiku", "data
iku" refer to the same company, but will be considered different entries in your database.
The goal of this internship is to improve the capabilities of our "distinct" processor to support fuzzy matching (aka: matching data that looks almost the same). You will participate to help our customers clean up their database, detect duplicated information and reduce them to a single line.
Why Engineering at Dataiku?
Dataiku's on-premise, cloud, or SaaS-deployed platform connects many data science technologies, and our technology stack reflects our commitment to quality and innovation. We integrate the best of data and AI tech, selecting tools that truly enhance our product. From the latest LLMs to our dedication to open source communities, you'll work with a dynamic range of technologies and contribute to the collective knowledge of global tech innovators. You can find out even more about working in Engineering at Dataiku by taking a look here.
How you'll make an impact
Get familiarwith Dataiku and its data preparation recipes as well as database schemas.

Participate todesigna new component able to detect duplicate data

Developthe User Interface that helps the user understand the clusters of data

Helpour users to reduce their data overload!

Stack
Python and Java for the backend side

JavaScript/Angular for the frontend part

#LI-Onsite

Source: Greenhouse

Job Function:

Requirements

Software Engineer Intern - Fuzzy Distinct

Company:

Dataiku

Place:

New York, New York

Job Function:

Information Technology

Report this offer

Similar offers

See more similar offers

Product Marketing Lead, Enterprise

Vimeo is looking for a passionate and seasoned enterprise product marketer to join the Product Marketing Team. You'll work at the intersection of product dev...

From Vimeo - New York

Published 6 days ago

Senior Application Security Engineer

Senior Application Security Engineer Alma is seeking a mission-driven Senior Application Security Engineer to join our team. We are dedicated to building sec...

From Alma - New York

Published 4 days ago

Claims Specialist I

What You'll Do:We're looking for an experienced Claims Specialist who is passionate about improving the landscape for mental healthcare. This position will h...

From Grow Therapy - New York

Published 4 days ago

Account Director (French Speaking)

Vimeo is looking for an experienced Account Director to build and maintain relationships with our key clients. In this role, you will be responsible for a la...

From Vimeo - New York

Published 5 days ago

Built at: 2024-05-15T11:35:51.613Z