Data Engineer/Scientist for ML job opportunity at Samsung.



DatePosted 13 Days Ago bot
Samsung Data Engineer/Scientist for ML
Experience: General
Pattern: full-time
apply Apply Now
Salary:
Status:

Job

Copy Link Report
degreeGeneral
loacation 24A, Kifissias Avenue,, Athens, Greece, Greece
loacation 24A, Kifissias..........Greece

Position Summary We are seeking a specialized Data Engineer or Data Scientist to manage the complete lifecycle of the training data that powers our AI models. This role is pivotal in curating, sanitizing, and structuring high-quality speech and text datasets, serving as the foundation for training state-of-the-art Automatic Speech Recognition (ASR), Text-to-Speech (TTS), and Machine Translation (MT) systems Role and Responsibilities Data Pipeline Architecture Design, build, and maintain robust pipelines for the ingestion, processing, and management of heterogeneous data sources, ensuring efficient flow from raw collection to model-ready inputs. Unstructured Data Extraction Extract and process high-fidelity speech data from complex, unstructured sources, including video feeds, multi-channel audio recordings, and raw text archives. Corpus Curation & Management Organize, structure, and analyze complex linguistic datasets, including speech-to-text alignments and parallel translation corpora, ensuring metadata accuracy and consistency. Data Cleaning & Noise Reduction Implement rigorous quality control protocols to identify and correct errors, remove artifacts, and apply noise reduction techniques to enhance audio clarity. Dataset Enhancement Strategies Develop and execute strategies to improve data quantity and diversity, including the application of data augmentation techniques and synthetic data generation. Cross-Functional Collaboration Partner closely with Machine Learning Engineers to align data preprocessing workflows and formatting with the specific requirements of various model architectures. Skills and Qualifications Programming Proficiency Advanced proficiency in Python and core data manipulation libraries (e.g., Pandas, NumPy) with the ability to write clean, efficient, and scalable code. Audio & Data Tooling Hands-on experience with audio processing and analysis tools (e.g., librosa, torchaudio, Praat) and database management systems (SQL/NoSQL). ML & NLP Fundamentals Solid understanding of Machine Learning principles and the specific preprocessing and tokenization requirements for Natural Language Processing (NLP) and speech tasks. Data Quality Expertise Proven track record in handling large-scale, messy, or unstructured datasets, with a strong focus on data validation, cleaning, and sanitization techniques. * Please visit Samsung membership to see Privacy Policy, which defaults according to your location, at: https://account.samsung.com/membership/policy/privacy . You can change Country/Language at the bottom of the page. If you are European Economic Resident, please click here : https://europe-samsung.com/ghrp/PrivacyNoticeforEU.html

Other Ai Matches

Implant Equipment Technician - Days Applicants are expected to have a solid experience in handling Job related tasks
Gas Engineer Applicants are expected to have a solid experience in handling Job related tasks
Material Quality Engineer Applicants are expected to have a solid experience in handling Job related tasks
Corporate Marketing Internship Applicants are expected to have a solid experience in handling Job related tasks
Business Continuity Engineer Applicants are expected to have a solid experience in handling Job related tasks
Data Analyst & Process Improvement Internship Applicants are expected to have a solid experience in handling Job related tasks
Procurement Professional: Buyer Applicants are expected to have a solid experience in handling Job related tasks
Cyber Security Engineering Internship Applicants are expected to have a solid experience in handling Job related tasks
GCS Slurry Equipment Engineer Applicants are expected to have a solid experience in handling Job related tasks
HAZMAT Material Management Professional Applicants are expected to have a solid experience in handling Job related tasks
Commercial Manager (m/w/d) Smartphone Business Applicants are expected to have a solid experience in handling Job related tasks
Sr. Strategy Manager – Services Business Applicants are expected to have a solid experience in handling Job related tasks
Program Development Internship Applicants are expected to have a solid experience in handling Job related tasks
Senior Product Designer, Ad Experiences & Solutions Applicants are expected to have a solid experience in handling Ad Experiences & Solutions related tasks
Interoperability (IoDT) Test Engineering Intern Applicants are expected to have a solid experience in handling Job related tasks
Werkstudent*in (m/w/d) MX B2B Product Management & Business Development Team Applicants are expected to have a solid experience in handling Job related tasks
PCS Scrubber Engineer Applicants are expected to have a solid experience in handling Job related tasks
System Developer Applicants are expected to have a solid experience in handling Job related tasks
Malay & English Linguist Applicants are expected to have a solid experience in handling Job related tasks
Reliability Engineer Applicants are expected to have a solid experience in handling Job related tasks
Process Integration Engineer Applicants are expected to have a solid experience in handling Job related tasks
Photo Quality Shift Engineer Applicants are expected to have a solid experience in handling Job related tasks
Marketing Operations & Admin Executive Applicants are expected to have a solid experience in handling Job related tasks