Hi There,
I'm Sushith Karuvelil Suthan
i am into
About MeI'm a seasoned Data Engineer with 4+ years of experience from Tata Consultancy Services, where I collaborated closely with the esteemed US client, Nielsen. My programming arsenal includes Python, Scala, Java, C, C++, Oracle PL/SQL, SQL, and Unix shell scripting. I specialize in Python, Scala, Airflow, Hive, Impala, Oracle PL/SQL, SQL, and various AWS services like EMR, EC2, S3, Athena, and Lambda. With a knack for problem-solving, coupled with robust interpersonal and communication skills, I excel in engaging with stakeholders at all levels. Let's embark on a data-driven journey together.
email : sushithks@gmail.com
Education is not the learning of facts, but the training of the mind to think.
Loyalist College In Toronto
Sree Narayana Guru Institute of Science & Technology | SNGIST
Validation of skills, fueled by ambition.
Google Cloud
unique health identification system hospitals and doctors to manage patents data secularly and effectively
Introducing our new Android application designed to alleviate the stress of finding parking in bustling cities. With our app, users can effortlessly locate available parking spots through an intuitive map interface and reserve a slot with just a few taps. But that's not all – we've revolutionized the parking landscape by enabling users to register their own land or spare parking spaces as available spots, allowing them to generate revenue from otherwise unused areas. Whether you're a driver in need of a convenient parking solution or a property owner looking to monetize your space, our app offers a seamless platform to meet your needs. Say goodbye to circling the block endlessly in search of a spot – with our app, finding parking has never been easier.
A machine learning model which identifies endangered species by image processing and provides the necessary details
An ETL project which utilizes python,Airflow,GCP
Automating BigQuery Table Creation from Google Cloud Storage Using Dataflow and Cloud Functions
Nov 2024 - Present
●
Training AI systems to efficiently browse, access, and retrieve relevant information from diverse online sources.
●
Helping AI systems identify and solve challenges more effectively.
●
Working on optimizing AI to retrieve the right information at the right time.
●
Training AI systems to efficiently browse and access information, enhancing their ability to navigate complex data landscapes and provide actionable insights.
●
Improving how AI interacts with users by delivering precise and timely insights.
●
Leveraging AI to provide clear, actionable recommendations.
●
Reviewing and assessing the work of peers and AI models to ensure alignment with best practices, accuracy, and strategic goals.
Dec 2023 - Present
●
Offer specialized expertise in designing, developing, and optimizing data solutions tailored to meet diverse client needs.
●
Creating robust data pipelines, managing complex ETL processes, and implementing advanced analytics solutions.
●
Gather, refine, and structure data from multiple sources.
●
Conduct statistical analysis and create visualizations to uncover valuable insights.
●
Support the design and execution of data analysis workflows.
●
Develop reports and dashboards to present findings and key performance indicators effectively.
●
Assist the front-end team by providing data-driven insights and supporting their integration with data visualization tools.
May 2023 - Sep 2023
●
Assist in data collection, cleaning, and organizing data from various sources.
●
Using MySQL and SQL queries to validate and verify data.
●
Perform statistical analysis and data visualization to extract meaningful insights.
●
Assist in the development and implementation of data analysis processes.
●
Validation of data using techniques like data mapping, data dictionary.
●
Developing Data Solutions on request.
●
Create reports and dashboards to communicate findings and key performance metrics.
Jan 2019 - Feb 2021
●
Part of the core team in a retail project, digital advertising rating (Nielsen, United States)
●
Automation of tasks using Apache Airflow.
●
Managing workflows which had almost 22 hours of run time for each daily run.
●
Implemented Process improvements which reduced execution time by 10%.
●
Extracting and analyzing metadata from varied sources and programmatically converting Data Science algorithms for smooth functioning of the system and eliminating false data and implementing data solutions.
●
Constructing intricate data processing and ETL data pipeline with a complete automated process, building jars, creating unit test cases, implementing quality assurance policies, version controlling using Git(CI/CD) and deployment.
●
Data query using AWS Athena, EMR cluster for running jobs, S3 for data storage.
●
Test-driven development to deliver defect free modules ensuring data quality resulting in 25% reduction in post-release defects, ensuring higher product quality.
●
Improved system efficiency by 18% through effective integration, resulting in faster execution and response times.
●
Coached and mentored new grad associates to handle critical daily tasks and provide customer service support, thus improving their efficiency by 30% within the first three months of mentoring hence proving leadership skills.