1 00:00:00,005 --> 00:00:01,003 - [Instructor] If you've ever done 2 00:00:01,003 --> 00:00:03,002 any data science work with Python, 3 00:00:03,002 --> 00:00:06,009 then you're probably familiar with the pandas library. 4 00:00:06,009 --> 00:00:09,009 You can find the documentation for pandas at this link, 5 00:00:09,009 --> 00:00:12,005 and I would suggest keeping it open in a browser tab 6 00:00:12,005 --> 00:00:14,003 as you work through this chapter 7 00:00:14,003 --> 00:00:15,009 in case you want to refer to it 8 00:00:15,009 --> 00:00:18,004 or try out some of your own ideas. 9 00:00:18,004 --> 00:00:21,006 If I click on the API reference, 10 00:00:21,006 --> 00:00:25,002 you can see various parts of the pandas library. 11 00:00:25,002 --> 00:00:27,000 pandas is specifically designed 12 00:00:27,000 --> 00:00:30,004 for data manipulation and analysis, 13 00:00:30,004 --> 00:00:33,009 and one of its core features is a high performance, 14 00:00:33,009 --> 00:00:37,006 easy-to-use structure called a data frame. 15 00:00:37,006 --> 00:00:39,004 And I'll click on that link here. 16 00:00:39,004 --> 00:00:43,001 A data frame is a two-dimensional data table 17 00:00:43,001 --> 00:00:44,004 similar to a spreadsheet, 18 00:00:44,004 --> 00:00:47,002 but with a lot of extra built-in features, 19 00:00:47,002 --> 00:00:50,006 some of which we will see later in this chapter. 20 00:00:50,006 --> 00:00:52,002 Now, pandas, as you can see, 21 00:00:52,002 --> 00:00:54,003 is a very comprehensive library. 22 00:00:54,003 --> 00:00:56,001 It has a lot of features. 23 00:00:56,001 --> 00:00:58,004 The API surface is pretty big. 24 00:00:58,004 --> 00:01:01,004 And it is well beyond the scope of this course 25 00:01:01,004 --> 00:01:05,004 to do a deep dive into all of its features. 26 00:01:05,004 --> 00:01:07,001 I'm going to focus in this chapter 27 00:01:07,001 --> 00:01:12,003 on reading and writing data, both in CSV and Excel formats, 28 00:01:12,003 --> 00:01:14,005 along with some basic data frame operations 29 00:01:14,005 --> 00:01:17,002 like exploring and manipulating data. 30 00:01:17,002 --> 00:01:19,002 If you want to learn more about pandas, 31 00:01:19,002 --> 00:01:22,000 there are several other courses available here 32 00:01:22,000 --> 00:01:25,000 in the online library that you can refer to. 33 00:01:25,000 --> 00:01:26,003 So, let's make sure that pandas 34 00:01:26,003 --> 00:01:29,001 is properly installed on your system. 35 00:01:29,001 --> 00:01:30,000 And what I'm going to do 36 00:01:30,000 --> 00:01:34,002 is I'm going to do this in Visual Studio Code. 37 00:01:34,002 --> 00:01:39,006 So here in VS Code, let's open up the built-in terminal, 38 00:01:39,006 --> 00:01:43,007 which is right here, and let's make that full screen. 39 00:01:43,007 --> 00:01:50,007 So, I'm going to type the command pip show pandas, 40 00:01:50,007 --> 00:01:52,007 and you can see that when I do that, 41 00:01:52,007 --> 00:01:54,001 I'm getting some output here 42 00:01:54,001 --> 00:01:56,002 in the terminal about the library. 43 00:01:56,002 --> 00:01:59,006 Now, if you didn't get a result that looks like this, 44 00:01:59,006 --> 00:02:02,001 you might need to install it. 45 00:02:02,001 --> 00:02:08,000 So, if you have to install it, type pip install pandas, 46 00:02:08,000 --> 00:02:10,002 and you can see that I've already got it installed. 47 00:02:10,002 --> 00:02:13,003 So, all the requirements are already satisfied, 48 00:02:13,003 --> 00:02:14,006 but if you don't have it, 49 00:02:14,006 --> 00:02:17,000 then this will install it onto your computer. 50 00:02:17,000 --> 00:02:20,000 So that's all you need to do to get set up for this chapter.