Python Projects Part 1

For this project you will be working with the file insurance_claim_updated_short.csv located on the homework desciption for the first Python homework.

Make sure you can open the file as a dataframe, as in the cell below, and check out the columns.

Then, in your teams, decide on some interesting and practical questions that might be answered by analyzing this data. With your group, answer the question(s) with: 1. Useful visualizations of the data. 2. Relevant statistics, e.g. correlation coefficients.

Be sure to state the answer to your question, and what advice you might give to the insurance company if you were analyzing this data on their behalf. Some (obvious?) ideas to get you thinking: 1. What is the typical age of a driver filing a claim with this company? 2. How long has a driver typically been a customer with the company when they file a claim? 3. Is the age of the driver correlated with the total claim amount?

It is also permissable and expected that you’ll come up with some questions that you’re unsure how to answer. Make a note of those questions!

Time permitting, we’ll close class on the final project day with a brief discussion/presentation by each group on their findings.

Don’t be afraid to look things up! scipy and matplotlib documentation is a Google search away. Try to look up the answer before asking.