mes6@njit.edu   

Disclaimer : This website is going to be used for Academic Research Purposes.

Machine Learning

Data Wrangling

Data wrangling, sometimes referred to as Data Munging, is the process of manipulating raw data into a format that is more appropriate and useful for further analysis. It involves cleaning up data by removing or modifying incorrect data points, normalizing or standardizing formats, transforming data into different forms that are more suitable for analysis, and […]

Data Science

Data science is a field of study that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data.  Data Science is an Interdisciplinary Field It is an interdisciplinary field of practice that combines mathematics, statistics, computer science, artificial intelligence (AI) and machine learning (ML) to analyze large volumes […]

Data Pipeline

A data pipeline is a set of processes that allow for the efficient and reliable flow of information from one computing system to another. It enables organizations to move large volumes of data between different sources in a timely and cost-effective manner.  Uses of Data Pipelines Data pipelines also enable organizations to collect, cleanse, transform […]

Data Mining

Data Mining is the process of analyzing large datasets and uncovering patterns, relationships, and other useful information.  Techniques Involves in Data Mining It involves applying techniques such as classification, clustering, association rule learning, anomaly detection, and regression to data collected from numerous sources. Data mining can help unlock insights that wouldn’t be possible by manual […]

Data Literacy

Data Literacy is the ability to read, write, understand and analyze data. It is an essential skill in the digital age that everyone should possess. Data literacy involves being able to interpret data and use it to make informed decisions.  Understanding of Data Literacy It involves understanding how data is structured, collected, analyzed and visualized, […]

Data Lake

A Data Lake is a single, centralized platform that stores large volumes of structured and unstructured data from multiple sources.  Uses of Data Lake It can be used to store various types of data, including text files, images, videos, and audio files. Data Lakes provide an efficient way to aggregate and process all kinds of […]

Data Governance

Data Governance- Planning is the process of creating a plan for data governance, which is the process of managing and securing data and information within an organization. It includes defining policies, procedures and controls that ensure the accuracy, quality, consistency, security, and integrity of data throughout its lifecycle. Comprehensive Approach  Also Data Governance is a […]

Dataframe

A DataFrame is a tabular data structure with labeled axes (rows and columns). It is an essential tool for data manipulation, analysis, and visualization. It allows you to easily store, organize, and process large amounts of data so that it can be used to answer important questions or solve complex problems.  Structure of DataFrame DataFrames […]