top of page
All Blogs
Search


The Basics of k-Anonymity: Making Individuals Harder to Identify
k-Anonymity is a privacy-preserving technique designed to reduce the risk of identifying individuals in a dataset. The central idea is simple: each record should be indistinguishable from at least k − 1 other records based on a selected set of attributes.
Swetha Srihari
Jun 56 min read


The Privacy Challenge in Machine Learning
I had always believed that we can maintain privacy by not revealing too much information about ourselves in online forums and social media. But my perspective changed whenI took a course during my graduate studies called “Privacy Aware Computing”. I attended the first lecture without even enrolling in the course because I wanted to know if it was something that would interest me. I was not expecting much. But I was so drawn by the subject. That single lecture was enough to c
Swetha Srihari
Jun 47 min read


From Raw Healthcare Data to Diabetes Insights: End-to-End Data Analytics Project Using Python & Tableau
In Healthcare, data is often available in large quantities with messy and inconsistent format. When I explored this dataset, I noticed missing values, inconsistent formats, outliers, and unstructured clinical measurements. This blog mainly focuses on transforming raw healthcare data into meaningful insights using Python for data cleaning and feature engineering, followed by building interactive dashboards in Tableau. The main goal of this blog, before building dashboards or
saranyashanmugam200
Jun 36 min read


Building and Deploying a Streamlit Dashboard: A Complete Step-by-Step Guide
When I started working on a diabetes dashboard data, I wanted something simple: a clean web app where I could explore data, detect outliers, visualize patterns, and run a machine learning model with a known and easy language python. Streamlit is a remarkable Python library that lets you turn a normal Python script into a fully interactive web application. In this blog, I’ll walk through everything I did — from installing packages, building the app, adding caching and navig
saranyashanmugam200
Jun 14 min read
How to build a Machine Learning Model?
Machine Learning has become one of the most talked-about topics in the industry today. From Netflix recommendations to Google search results, machine learning is everywhere. When I first started learning machine learning, I assumed it was all about complex math and confusing algorithms. But once I understood the workflow, everything started to make sense. Machine learning is not magic. It is a step-by-step process, very similar to how humans learn from experiences. Let's bre
Anandhi Kamalakannan
Jan 124 min read


Machine Learning 101
I always thought Machine learning is a complex term, something very difficult to understand for a basic programmer like me. Because of that belief, I assumed machine learning was not my genre at all. Today, I regret thinking that way. Once I started learning it, I realized that the concepts were not that scary at all, as I imagined. In fact, I was able to understand them and even relate them to real-world scenarios. I can now deeply connect the dots. For the first time, I fee
Anandhi Kamalakannan
Jan 114 min read


Data Cleaning Explained: How Clean Data Drives Better Visualizations and Decisions
Hi team! Check out my latest blog on why data cleaning is critical for accurate visualizations and better business decisions. I also show a real-world COVID-19 dataset case study using Python.
Gayathri Venkatachalam
Jan 98 min read


Machine Learning: Introduction,Types, Algorithms, and Real-World Applications.
What is Machine Learning and Why It Matters? Machine Learning (ML) is a powerful subset of Artificial Intelligence (AI) that lies at the...
shravanibotta
May 22, 20258 min read


The Power of Data Cleaning: Mess to Masterpiece
Suze Orman once said, “Cleanliness is a state of purity, clarity, and precision. ” This not only applies to our day-to-day cleaning, but...
Charishma Chadalavada
Dec 19, 20246 min read


A Step by step by Research in ML using Decision Tree Algorithm
DRS bank is facing challenging times. Their NPAs (Non-Performing Assets) have been on a rise recently and a large part of these are due...
Yamini
Dec 18, 20244 min read


Breaking Down the Tree - Part 3
DECISION TREE - PART 3- Yamini Kumaresan
Yamini
Dec 17, 20245 min read


Breaking Down the Tree - Part 2
Decision Tree - Part 2 - Yamini Kumaresan
Yamini
Dec 17, 20246 min read


Breaking Down the Tree: Exploring Decision Tree Machine Learning Algorithms
Decision Tree Part 1 - Yamini Kumaresan
Yamini
Dec 16, 20244 min read


Data Granularity at its finest!
Data is everywhere and of course in every possible form of detail around us. Data has become the core of every aspect in our day to day...
Sruthi Sudhagaran
Aug 29, 20246 min read


King county Housing DATA analysis
Sowmyalakshmi King county is one of the fastest growing city in the US. The early 20th century was a time of population growth and...
Sowmyalakshmi Suresh kumar
Jul 15, 20232 min read


Interpolation using pandas
Introduction to interpolation Interpolation is one of the methods of filling null values. Before learning about interpolation, let us...
Jyothi Chanda
Mar 31, 20223 min read


EDA using Pandas Profiling
Photo by Stephen Phillips - Hostreviews.co.uk on Unsplash EDA (Exploratory Data Analysis) : Exploratory Data Analysis (EDA) is an...
Ruchi Sharma
Mar 31, 20223 min read


Difference between loc and iloc in pandas
Photo by Emily Morter on Unsplash In Data analysis, it is very important how you select data or in another terms Slicing and Dicing of...
Ruchi Sharma
Mar 28, 20223 min read


Parsing JSON dataset using Pandas
Photo by Gabriel Heinzer on Unsplash In the process of Data gathering a Data Analyst have to handle various forms of data from different...
Ruchi Sharma
Mar 26, 20222 min read


A thumbnail of Feature Engineering
You can't build a great building on a weak foundation. You must have a solid foundation if you're going to have a strong super structure....
Mahitha Kumar
Sep 6, 20213 min read
bottom of page