Lead Data Scientist job opportunity at Smarsh.



bot
Smarsh Lead Data Scientist
Experience: 2-years
Pattern: remote
apply Apply Now
Salary:
Status:

Data science

Copy Link Report
degreePhD
loacation Atlanta / New York, United States Of America
loacation Atlanta / New ..........United States Of America

As a Lead #Data #Scientist (NLP & #Financial #Compliance) at #Smarsh, you will spearhead the development of state-of-the-art natural language processing (NLP) and large language model (LLM) solutions that power next-generation compliance and surveillance systems. You’ll work on highly specialized problems at the intersection of natural language processing, communications intelligence, financial supervision, and regulatory compliance, where unstructured data from emails, chats, voice transcripts, and trade communications hold the keys to uncovering misconduct and risk. The role will involve working with other Senior Data Scientists and mentoring Associate Data Scientists in analyzing complex data, generating insights, and creating solutions as needed across a variety of tools and platforms. This role demands both technical excellence in NLP modeling and a deep understanding of financial domain behavior—including insider trading, market manipulation, off-channel communications, MNPI, bribery, and other supervisory risk areas. The ideal candidate for this position will possess the ability to perform both independent and team-based research and generate insights from large data sets with a hands-on/can do attitude of servicing/managing day to day data requests and analysis. This role also offers a unique opportunity to get exposure to many problems and solutions associated with taking machine learning and analytics research to production. On any given day, you will have the opportunity to interface with business leaders, machine learning researchers, data engineers, platform engineers, data scientists and many more, enabling you to level up in true end-to-end data science proficiency. __ How will you contribute? Collect, analyze, and interpret small/large datasets to uncover meaningful insights to support the development of statistical methods / machine learning algorithms. Lead the design, training, and deployment of NLP and transformer-based models for financial surveillance and supervisory use cases (e.g., misconduct detection, market abuse, trade manipulation, insider communication). Development of machine learning models and other analytics following established workflows, while also looking for optimization and improvement opportunities Data annotation and quality review Exploratory data analysis and model fail state analysis Contribute to model governance, documentation, and explainability frameworks aligned with internal and regulatory AI standards. Client/prospect guidance in machine learning model and analytic fine-tuning/development processes Provide guidance to junior team members on model development and EDA Work with Product Manager(s) to intake project/product requirements and translate these to technical tasks within the team’s tooling, technique and procedures Continued self-led personal development __ What will you bring? Strong understanding of financial markets, compliance, surveillance, supervision, or regulatory technology Experience with one or more data science and machine/deep learning frameworks and tooling, including scikit-learn, H2O, keras, pytorch, tensorflow, pandas, numpy, carot, tidyverse Command of data science and statistics principles (regression, Bayes, time series, clustering, P/R, AUROC, exploratory data analysis etc…) Strong knowledge of key programming concepts (e.g. split-apply-combine, data structures, object-oriented programming) Solid statistics knowledge (hypothesis testing, ANOVA, chi-square tests, etc…) Knowledge of NLP transfer learning, including word embedding models (gloVe, fastText, word2vec) and transformer models (Bert, SBert, HuggingFace, and GPT-x etc.) Experience with natural language processing toolkits like NLTK, spaCy, Nvidia NeMo Knowledge of microservices architecture and continuous delivery concepts in machine learning and related technologies such as helm, Docker and Kubernetes Familiarity with Deep Learning techniques for NLP. Familiarity with LLMs - using ollama & Langchain Excellent verbal and written skills Proven collaborator, thriving on teamwork __ Preferred Qualifications Master’s or Doctor of Philosophy degree in Computer Science, Applied Math, Statistics, or a scientific field Familiarity with cloud computing platforms (AWS, GCS, Azure) Experience with automated supervision/surveillance/compliance tools

Other Ai Matches

remote-jobserver Remote
Integration Architect Applicants are expected to have a solid experience in handling Divisions – Shared Services related tasks
Manager, Corporate Sales Applicants are expected to have a solid experience in handling Divisions – Sales Front Line related tasks
Senior Product Manager – Intelligence Catalog Applicants are expected to have a solid experience in handling Divisions – Product Management related tasks
remote-jobserver Remote
Solutions Engineer Applicants are expected to have a solid experience in handling Divisions – Sales Support related tasks
Software Engineer III Applicants are expected to have a solid experience in handling Divisions – Enterprise Engineering related tasks
remote-jobserver Remote
Salesforce Functional Architect Applicants are expected to have a solid experience in handling Divisions – Shared Services related tasks
Accounts Payable Lead Applicants are expected to have a solid experience in handling Divisions – Finance related tasks
Sr. Platform Engineer II Applicants are expected to have a solid experience in handling Divisions – Enterprise Engineering related tasks
Sr. Manager, Corporate Sales Applicants are expected to have a solid experience in handling Divisions – Sales Front Line related tasks
Digital Support Program Manager Applicants are expected to have a solid experience in handling Divisions – Customer Support related tasks
Account Executive III, Corporate Accounts (New Logo) Applicants are expected to have a solid experience in handling Divisions – Sales Front Line related tasks
remote-jobserver Remote
Sr. Channel Account Manager (Voice Compliance & SaaS Solutions) Applicants are expected to have a solid experience in handling Divisions – Sales Front Line related tasks
Platform Engineer III - Keycloak Applicants are expected to have a solid experience in handling Divisions – Enterprise Engineering related tasks
remote-jobserver Remote
Sr. Solutions Engineer Applicants are expected to have a solid experience in handling Divisions – Sales Support related tasks
Account Executive II, Corporate Accounts (New Logo) Applicants are expected to have a solid experience in handling Divisions – Sales Front Line related tasks
remote-jobserver Remote
Account Executive I, Corporate Accounts Applicants are expected to have a solid experience in handling Divisions – Sales Front Line related tasks
Sr. Salesforce Administrator Applicants are expected to have a solid experience in handling Divisions – Shared Services related tasks
remote-jobserver Remote
Lead Data Scientist Applicants are expected to have a solid experience in handling Divisions – Applied Machine Learning related tasks
Lead Data Scientist Applicants are expected to have a solid experience in handling Data science related tasks
Sr. Principal, Product Management Applicants are expected to have a solid experience in handling Divisions – Product Management related tasks
remote-jobserver Remote
Account Executive, Federal Sales Applicants are expected to have a solid experience in handling Divisions – Sales Front Line related tasks
Senior Java Developer Applicants are expected to have a solid experience in handling Divisions – Capture Engineering related tasks
remote-jobserver Remote
Sales Development Representative I (Public Sector) Applicants are expected to have a solid experience in handling Divisions – Sales Support related tasks