B

o

n

j

o

u

r

;

 

I

'

m

S

u

g

u

m

a

r

a

n

B

a

l

a

s

u

b

r

a

m

a

n

i

y

a

n

I

b

u

i

l

d

p

r

o

d

u

c

t

s

w

i

t

h

d

a

t

a

!

Data professional with 5+ years of experience in Data Analysis, Data Science, and Data Engineering. I am skilled in leveraging cutting-edge technologies such as Generative AI, Large Language Models (LLMs), MLOps, and cloud-native architectures to drive business growth. I am proficient in Python, SQL, Dataiku, Power BI, and cloud platforms (AWS, Azure, GCP).

I have a strong ability to deliver data-driven solutions tailored for business intelligence, automation, and decision-making.

</AboutMe>

Hello! I'm Sugumaran Balasubramaniyan, a passionate data enthusiast with a proven track record in transforming raw data into actionable insights. My expertise lies in bridging the gap between complex data systems and strategic business decisions.


With a strong foundation in Python, SQL, and cloud platforms like AWS, Azure, and GCP, I specialize in building scalable data pipelines, predictive models, and interactive dashboards. My goal is to empower organizations with data-driven solutions that enhance efficiency and drive innovation.


Sugumaran Balasubramaniyan

</Experience>

Data scientist
Freelance
Paris, Île-de-France, France | Jan 2024 - Present

• Developed an OCR pipeline using deep learning in Python (Tesseract, PyTorch), reducing manual data entry by 40%.
• Designed a supply chain optimization web app prototype for a pharmaceutical startup, cutting stockouts by 25%.
• Created a generative AI chatbot for a retail client, enhancing customer engagement by 30% and reducing support costs by 15%.
• Implemented a recommendation system for an e-commerce platform, increasing upsell opportunities by 20%.
• Developed a predictive maintenance model for a manufacturing client, reducing downtime by 15% and maintenance costs by 10%.

Data Analyst
Les Compagnons de Paris
Paris, Île-de-France, France | Jul 2023 - Dec 2023

• Built real-time KPI dashboards using Power BI/DAX, improving data accessibility by 27% for executive decision-making.
• Optimized website conversion rates by 20% through A/B testing and customer journey analysis.
• Designed advanced customer segmentation models (Python, Scikit-Learn), boosting qualified leads by 50%.

Data Analyst Associate
Capgemini Technology Services
Chennai, India | Apr 2021 - Jul 2022

• Automated ETL pipelines using Apache Airflow and AWS Glue, reducing processing time by 45% and errors by 63%.
• Engineered inventory forecasting models (SARIMA, LSTM) to cut costs by 21% (€270K/year).
• Improved on-time delivery rates from 72% to 95% using Six Sigma methodologies.

Data analyst
Infosys
Chennai, India | Mar 2020 - Apr 2021

• Created automated reports (Power BI, Excel VBA), saving 8 hours/week in manual effort.
• Mentored 2 junior analysts in Python/SQL, increasing team productivity by 25% and reducing delivery time by 15%.

Analyst
Arimalytics
Pondicherry, India | Jun 2018 - Feb 2020

• Enhanced forecast accuracy by 12% using advanced predictive modeling and regression analysis.
• Developed dashboards to track KPI metrics, enabling clients to monitor progress and make data-driven decisions.

</Education>

MSc Data Science and AI Strategy
emlyon business school
Paris, Île-de-France, France | Aug 2022 - Feb 2024

• This program uniquely bridged the gap between AI technology and business strategy, equipping me with the skills to design and deploy AI applications with a focus on responsible data governance and transparent practices. I gained a practical, action-oriented understanding of both the technical fundamentals and the human/business impacts of AI.

International Exchange Program
McGill University
Montréal, Québec, Canada | May 2023 - Jul 2023

• This program provided me with a comprehensive skillset in emerging technologies, including: developing and deploying Internet of Things (IoT) solutions, understanding North American business practices with a focus on Montreal's tech ecosystem, and designing and implementing advanced recommender systems. Through hands-on projects and theoretical study, I gained expertise in data analysis, hardware/software integration, and collaborative filtering techniques.

Post Graduate Program in Data Science
Great Learning
Chennai, India | July 2018 - Mar 2019

• This intensive program immersed me in key data science and analytics disciplines, including data analysis, machine learning (supervised and unsupervised), and text mining. I developed proficiency in essential tools and technologies like Python, R, Tableau, and database management, applying these skills through real-world industry case studies.

Master of Business Administration
Pondicherry University
Pondicherry, India | July 2016 - May 2018

• This specialized program offered in-depth training in key areas of Operations and Human Resources. I developed proficiency in Supply Chain Management, Operations Research, Service Operations Management, and Quality Management, alongside expertise in HR Analytics, Strategic Human Resource Management, and Human Resources Management. I also gained valuable knowledge in Strategic Management and Project Management.

Bachelor of Technology
Pondicherry University
Pondicherry, India | July 2012 - May 2016

• This four-year program equipped me with a deep understanding of mechanical engineering principles, covering a wide range of subjects including heat and mass transfer, kinematics, and automobile engineering. I developed proficiency in both theoretical concepts and practical applications, preparing me for roles in design, simulation, and control across various industries.

</Certifications>

AWS Machine Learning Engineer Associate
Amazon Web Services

• Proficient in designing, implementing, and deploying machine learning solutions on AWS.
• Expertise in SageMaker, feature engineering, and model optimization.
• Skilled in ML pipeline orchestration and automated model training workflows.

AWS Cloud Practitioner
Amazon Web Services

• Validated expertise in AWS cloud architecture and foundational services.
• Demonstrated knowledge of AWS pricing models and cost optimization strategies.
• Proficient in deploying scalable and secure cloud infrastructure on AWS.

Databricks AI Agents
Databricks Academy

• Mastered building autonomous AI agents using Databricks platform.
• Implemented LLM-based agents for complex task automation and reasoning.
• Expertise in prompt engineering and agentic workflow orchestration.

Snowflake Data Warehousing
Snowflake University

• Proficient in designing and managing cloud-based data warehouses using Snowflake.
• Expertise in data modeling, query optimization, and Snowflake governance.
• Skilled in data sharing and Snowflake collaboration features.

AWS GenAI Practitioner
Amazon Web Services

• Expert in building generative AI applications using AWS services.
• Proficient with Amazon Bedrock, SageMaker JumpStart, and generative AI tools.
• Skilled in prompt engineering and responsible AI practices.

Dataiku ML Practitioner
Dataiku Academy

• Proficient in end-to-end machine learning projects using Dataiku platform.
• Expertise in visual machine learning workflows and automated model selection.
• Skilled in model deployment and monitoring within Dataiku ecosystem.

Dataiku Developer
Dataiku Academy

• Expert in developing custom plugins and extensions for Dataiku platform.
• Skilled in Python development within Dataiku recipe and custom component frameworks.
• Proficient in integrating external APIs and data sources with Dataiku.

Atlassian Agile Project Management Professional
Atlassian Academy

• Expert in agile project management using Jira and Confluence platforms.
• Proficient in sprint planning, backlog management, and team collaboration workflows.
• Skilled in implementing agile methodologies and scaling agile practices across teams.

</Skills>

Tech Stack

  • Python
  • PyTorch
  • R
  • Scala
  • Azure-SQL
  • MySQL
  • Redis
  • PostgresSQL
  • GITHUB
  • HuggingFace
  • GIT
  • Anaconda
  • Apache-Spark
  • Apache-Airflow
  • Apache-Hadoop
  • Apache-Cassandra
  • Apache-Kafka
  • AWS
  • Azure
  • GCP
  • Power-BI
  • Tableau
  • NumPy
  • Pandas
  • Scikit-learn
  • Matplotlib
  • Plotly
  • Streamlit
  • Flask
  • Docker
  • Kubernetes
  • TensorFlow
  • HTML5
  • CSS3
  • JavaScript
  • React
  • Node.js
  • MongoDB
  • GraphQL
  • Confluence
  • Jira
  • Excel
  • FastAPI
  • OpenCV

</Projects>

Project 1

Customer Chrun

Project 2

Heart stroke prediction

Project 3

Sentiment analyzer

Project 4

Sleep disorder prediction

Project 5

Fraud detection using R

Project 6

Big Data analysis using Databricks

Project 7

Medical cost prediction