Middle Data Scientist (LLM)
WaveAccess is looking for a Data Scientist to join our team and contribute to innovative projects in the pharmaceutical domain. This role involves working with real-world pharmaceutical data and leveraging the power of Large Language Models (LLMs) to drive impactful insights and solutions.
Responsibilities:
- LLM Integration: Develop, fine-tune, and implement Large Language Models to analyze and process diverse sets of text and medical data
- Data Analysis: Perform advanced data analysis on real-world pharmaceutical datasets to extract meaningful insights and support decision-making processes
- Text Mining and NLP: Utilize natural language processing techniques to extract relevant information from large volumes of text, including medical literature, patient records, and clinical trial data
- Model Development: Build and validate predictive models to address key challenges in the pharmaceutical industry, such as drug efficacy, patient outcomes, and adverse event prediction
- Innovation: Stay up-to-date with the latest advancements in LLMs and NLP, and apply innovative approaches to solve complex problems in the pharmaceutical field
Requirements:
- At least 3 years of experience in a Data Scientist position
- English - B2
- Deep knowledge of Neural Networks and architectures for working with sequences, in particular (RNN, LSTM, Transformers, CNN, attention)
- Experience with Large Language Models (LLMs) and their application. Familiarity with modern LLM techniques such as Retrieval-Augmented Generation (RAG) and LLM agents
- Solid Python skills
- Experience in presenting achieved results
Technologies:
- Python
- Transformers
- LLM
- Standard NLP stack
- Standard ML stack
- Basic SQL
- Git
- Vector databases(Postgres+pgvector / Milvus/ Qdrant/ Faiss)
Preferred:
- Knowledge of general Machine Learning approaches
- Knowledge of mathematical statistics
- Experience with AWS (EC2, S3)
- Linux + bash, ssh
- Experience in written and verbal communication with business stakeholders
- Experience with full development cycle
Nice to have:
- RestAPI development experience
- Snowflake
- Docker
- Understanding of CI/CD
- Java/C++/Other languages
Обслуживать клиентов: работать со счетами, пластиковыми картами и денежными переводами. Продавать банковские и страховые продукты (кредитные продукты, карты, вклады).
Обслуживать клиентов: работать со счетами, пластиковыми картами и денежными переводами. Продавать банковские и страховые продукты (кредитные продукты, карты, вклады).