Ilia Zlobin

Systems Architect (Cloud), SRE/DevOps, AI/ML Engineer, ML/LLMOps, Researcher

Picture of Ilia Zlobin

Professional Career

An accomplished industry professional with over a decade of experience spanning Cloud Architecture, DevOps, Site Reliability Engineering (SRE), Software Development Engineering (SDE), and AI and MLOps Engineer.

Demonstrated expertise in delivering exceptional results for a diverse clientele of over 30 organizations, leveraging a blend of technical acumen, strategic thinking, and effective leadership.

Continuous pursuit of knowledge and refinement of methodologies has always ensured the successful execution of projects of various complexity. This involves staying up-to-date with the latest technological advancements, regularly attending professional conferences, connecting and communicating with industry professionals, and engaging in hands-on experimentation with emerging tools and techniques. By integrating new insights and best practices into his workflow, he is able to streamline processes, enhance efficiency, and mitigate potential risks, thereby guaranteeing the timely and successful delivery of projects of various types.

Earned certifications from leading industry providers including Amazon Web Services (AWS), Google Cloud Platform (GCP), Microsoft Azure, and Cloud Native Computing Foundation (CNCF), validating proficiency in cloud technologies and best practices.

Actively educates himself about advancments of Machine Learning (ML) and Artificial Intelligence (AI) through rigorous research and development work while reporting about the findings through the series of blog posts and professional YouTube videos

AI/ML Research Projects

  • Conducted in-depth investigations into Fine-Tuned Transformers and Synthetic Machine Learning (SML) models such as Llama, Gemma, Phi, and Mixtral for Grammar correction, documenting findings and methodologies.
  • Explored the practical applications of DSPy framework, encompassing research, design, architecture, and demonstration of its capabilities in diverse contexts.
  • Evaluated Visual Language Models (VLM) for image and video analysis, experimenting with different methodologies and presenting insights derived from empirical observations.
  • Engaged in ongoing studies focused on diffusion models, while also exploring Transfer Learning (TRL), Reinforcement Learning with Human Feedback (RLHF), and efficient model serving strategies such as TGI, Triton, vLLM, Kubeflow, MLFlow, and Kubernetes (K8S)/Ray.
  • Continuously monitors and reports on the latest developments in the fields of Machine Learning, Artificial Intelligence, and Large Language Models (LLMs), contributing to the dissemination of knowledge within the professional community.

Full-Stack Development Projects

  • Developed and deployed NextJS applications as serverless solutions on Lambda Edge using customized Serverless Stack (SST), sharing insights and tutorials on architecture and implementation.
  • Created cloud-based blogs summarization solutions using AWS Step Functions and Serverless Stack (SST), optimizing content management processes and enhancing accessibility.
  • Implemented an innovative Notion to NextJS Blog conversion API with custom authorization mechanisms, facilitating seamless migration and integration of content management systems.

Core Skills and Competency Areas

  • Expertise in Cloud Architecture, including AWS, GCP, and Azure environments.
  • Proficient in DevOps and Site Reliability Engineering (SRE) methodologies.
  • Specialized knowledge in Machine Learning Operations (MLOps) and Large Language Model Operations (LLMOps).
  • Advanced skills in Research and Development (R&D) within the fields of Machine Learning (ML) and Artificial Intelligence (AI).
  • Strong capabilities in System Design, Optimization, and Technical Leadership.
  • Extensive experience in Continuous Integration and Continuous Deployment (CI/CD) pipelines.
  • In-depth understanding of Microservices Architecture and Kubernetes (EKS, K8S) orchestration.
  • Proficiency in Infrastructure as Code (IaC) frameworks and practices.