Recruitment.bg is a boutique IT recruitment company, based in Bulgaria. We aim to work with the top employers in the industry, companies that we thoroughly vet and trust. Our mission is to guide IT professionals toward improved career paths by understanding their skills, crafting employment strategies, and supporting them every step of the way. Placing emphasis on honesty, respect and reliability while delivering exceptional service by ‘going the extra mile’ we build long term relationships with the people and organizations we work with.
About the Client
Fast-growing iGaming product company with a platform used by players in 50+ countries. They build high-scale, real-time systems for online casino and sports betting with modern microservices architecture, cloud-native infrastructure, and active ML implementation for data analysis and responsible gaming.
The Role
Architect, develop, and optimize generative AI infrastructure across the full ML lifecycle—from model development to production deployment.
Key Responsibilities:
Design scalable systems for training and serving LLMs
Develop low-latency inference pipelines and robust APIs
Implement advanced NLP: few-shot learning, prompt engineering, RAG
Fine-tune pre-trained models for specific use cases
Optimize model inference and distributed training architectures
Requirements
Must-Have:
5+ years in ML with focus on NLP and deep learning
Strong Python + PyTorch/TensorFlow + Hugging Face Transformers
Experience with ML ops tools, model optimization, and distributed training
Solid software engineering practices (CI/CD, containerization)
Nice-to-Have:
Transformer architectures & RLHF experience
Cloud platforms (AWS/GCP/Azure) with ML services
Open-source contributions or ML research publications