**NEW MERCH AVAILABLE!** podcast; sports articles; consulting

Feed Companion
Feed Companion
  • Home
  • PROJECTS
    • HOOPFOOD
    • CHARTFOOD
  • MERCHANDISE
  • PODCAST
  • DONATE
  • More
    • Home
    • PROJECTS
      • HOOPFOOD
      • CHARTFOOD
    • MERCHANDISE
    • PODCAST
    • DONATE
  • Home
  • PROJECTS
    • HOOPFOOD
    • CHARTFOOD
  • MERCHANDISE
  • PODCAST
  • DONATE

WORKING WITH THE TITANIC DATA SET

The titanic data set is considered the "Hello, World" of data sets in data science. Here, I go through the data with the help of a guide on the VScode website that allows you a tutorial for Data Science and VS Code, the integrated development environment.

First I imported pandas and numpy into my data set (links to documentation provided).

Next I imported the CSV data with the help of pandas' read_csv(" ")

and called for the data in the next cell. 


A cleanly organized data frame is returned including data on passenger class, whether the passenger survived, name, sex, and much more. 


Legend to the data can be found at the tutorial site mentioned above.

Finally now you can use Seaborn and Matplotlib, which are plotting libraries for data visualization.

Lastly, we will look at one of the violin plots provided by Seaborn. 


This allows us to see two variables at play. 

Passenger age correlated to whether they survived. 


The data, separated for whether they were male or female, you can see that very few over the age of 60 years old survived. 



Show More

Copyright © 2023 Feed Companion - All Rights Reserved.