Hi There,
I'm Sushith Karuvelil Suthan

i am into

About Me

About Me

I'm Sushith

Data Engineer

I'm a seasoned Data Engineer with 3+ years of experience from Tata Consultancy Services, where I collaborated closely with the esteemed US client, Nielsen. My programming arsenal includes Python, Scala, Java, C, C++, Oracle PL/SQL, SQL, and Unix shell scripting. I specialize in Python, Scala, Airflow, Hive, Impala, Oracle PL/SQL, SQL, and various AWS services like EMR, EC2, S3, Athena, and Lambda. With a knack for problem-solving, coupled with robust interpersonal and communication skills, I excel in engaging with stakeholders at all levels. Let's embark on a data-driven journey together.

email : sushithks@gmail.com

Skills & Abilities

Python
Scala
Java
SQL
PostgreSQL
Airflow
Spark
AWS
GCP
Azure
GCP
Hadoop
Kafka
Terraform
Hive
CI/CD
PowerBi
Tableau
ElasticSearch
lucene
pandas
NumPy
Keras
TensorFlow

My Education

Education is not the learning of facts, but the training of the mind to think.

Artificial Intelligence and Data Science

Loyalist College In Toronto

2022-2023

Bachelor of Computer Science and Engineering

Sree Narayana Guru Institute of Science & Technology | SNGIST

2014-2018

Projects

UNHID

unique health identification system hospitals and doctors to manage patents data secularly and effectively

Park-View

Introducing our new Android application designed to alleviate the stress of finding parking in bustling cities. With our app, users can effortlessly locate available parking spots through an intuitive map interface and reserve a slot with just a few taps. But that's not all – we've revolutionized the parking landscape by enabling users to register their own land or spare parking spaces as available spots, allowing them to generate revenue from otherwise unused areas. Whether you're a driver in need of a convenient parking solution or a property owner looking to monetize your space, our app offers a seamless platform to meet your needs. Say goodbye to circling the block endlessly in search of a spot – with our app, finding parking has never been easier.

Endangered species detection

A machine learning model which identifies endangered species by image processing and provides the necessary details

Weather report generation

An ETL project which utilizes python,Airflow,GCP

GCS to BigQuery

Automating BigQuery Table Creation from Google Cloud Storage Using Dataflow and Cloud Functions

Experience

Freelance

Data Developer | Fulltime

Dec 2023 - Present

● Offer specialized expertise in designing, developing, and optimizing data solutions tailored to meet diverse client needs.
● Creating robust data pipelines, managing complex ETL processes, and implementing advanced analytics solutions.
● Gather, refine, and structure data from multiple sources.
● Conduct statistical analysis and create visualizations to uncover valuable insights.
● Support the design and execution of data analysis workflows.
● Develop reports and dashboards to present findings and key performance indicators effectively.
● Assist the front-end team by providing data-driven insights and supporting their integration with data visualization tools.

Docks

Data Analyst | Co-op

May 2023 - Sep 2023

Tata Consultancy Services | TCS

Data Engineer | Full time

Jan 2019 - Feb 2021

● Part of the core team in a retail project, digital advertising rating (Nielsen, United States)
● Automation of tasks using Apache Airflow.
● Managing workflows which had almost 22 hours of run time for each daily run.
● Implemented Process improvements which reduced execution time by 10%.
● Extracting and analyzing metadata from varied sources and programmatically converting Data Science algorithms for smooth functioning of the system and eliminating false data and implementing data solutions.
● Constructing intricate data processing and ETL data pipeline with a complete automated process, building jars, creating unit test cases, implementing quality assurance policies, version controlling using Git(CI/CD) and deployment.
● Data query using AWS Athena, EMR cluster for running jobs, S3 for data storage.
● Test-driven development to deliver defect free modules ensuring data quality resulting in 25% reduction in post-release defects, ensuring higher product quality.
● Improved system efficiency by 18% through effective integration, resulting in faster execution and response times.
● Coached and mentored new grad associates to handle critical daily tasks and provide customer service support, thus improving their efficiency by 30% within the first three months of mentoring hence proving leadership skills.