Skip to content

vinh2155/Data_manipulation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data_manipulation

Why ?

I recently saw an interesting video that "Machine Learning practitioners spend nearly 80% of their time working with data!"! Since I am still in the early stage of my learning journey in Data Science, I think I should master the basics in order to save me time for my real projects. That's why I am going to manipulate a dataset.

Dataset vs Dataframe

Dataset: Any collection of data, which can come in various formats such as CSV files, databases, JSON, or Excel sheets. It's a broad term that refers to data in any structured or unstructured form.

DataFrame: A specific type of data structure, usually in a tabular format, where data is organized into labeled rows and columns. It's used in programming environments like Python's pandas or R for data manipulation and analysis.

What ?

I'm going to use a dataset that I find interesting to make this practice a fun moment :). I will be using a Nobel Prices dataset in a CSV file that I found on Kaggle !

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors