Lead Data Scientist LLM
WaveAccess is looking for a Data Scientist to join our team and contribute to innovative projects in the pharmaceutical domain. This role involves working with real-world pharmaceutical data and leveraging the power of Large Language Models (LLMs) to drive impactful insights and solutions.
Responsibilities:
- LLM Integration: Develop, fine-tune, and implement Large Language Models to analyze and process diverse sets of text and medical data
- Data Analysis: Perform advanced data analysis on real-world pharmaceutical datasets to extract meaningful insights and support decision-making processes
- Text Mining and NLP: Utilize natural language processing techniques to extract relevant information from large volumes of text, including medical literature, patient records, and clinical trial data
- Model Development: Build and validate predictive models to address key challenges in the pharmaceutical industry, such as drug efficacy, patient outcomes, and adverse event prediction
- Innovation: Stay up-to-date with the latest advancements in LLMs and NLP, and apply innovative approaches to solve complex problems in the pharmaceutical field
- Team Leadership: Manage and mentor a team of developers to ensure effective collaboration and high-quality deliverables, fostering a culture of innovation and continuous improvement while aligning project goals with organizational objectives.
Requirements:
- At least 5 years of experience in a Data Scientist position
- English - B2
- Deep knowledge of Neural Networks and architectures for working with sequences, in particular (RNN, LSTM, Transformers, CNN, attention)
- Experience with Large Language Models (LLMs) and their application. Familiarity with modern LLM techniques such as Retrieval-Augmented Generation (RAG) and LLM agents
- Solid Python skills
- Experience in presenting achieved results
- Experience in managing a team
Technologies:
- Python
- Transformers
- LLM+ RAG
- Standard NLP stack
- Standard ML stack
- Basic SQL
- Git
- Vector databases(Postgres+pgvector / Milvus/ Qdrant/ Faiss)
Preferred:
- Knowledge of general Machine Learning approaches
- Knowledge of mathematical statistics
- Experience with AWS (EC2, S3)
- Linux + bash, ssh
- Experience in written and verbal communication with business stakeholders
- Experience with full development cycle
Nice to have:
- RestAPI development experience
- Snowflake
- Docker
- Understanding of CI/CD
- Java/C++/Other languages
We offer the following conditions:
- Work in a dynamic international team
- Opportunity for collaboration through individual entrepreneurship/self-employment for colleagues outside of Russia
- Participation in foreign and Russian projects
- Health insurance with dental coverage
- Necessary equipment for work
- Corporate training programs
- Wide opportunities for self-realization, professional and career growt
- Democratic approach to processes and flexible start of the workday
- Option to relocate between our overseas offices.
Обслуживать клиентов: работать со счетами, пластиковыми картами и денежными переводами. Продавать банковские и страховые продукты (кредитные продукты, карты, вклады).
Обслуживать клиентов: работать со счетами, пластиковыми картами и денежными переводами. Продавать банковские и страховые продукты (кредитные продукты, карты, вклады).