Module 1, Week 1, gretl Problem Set 1

 

Module 1, Week 1, gretl Problem Set 1

 

Use the “Mass Transit Use and Employment” dataset and gretl to answer the following questions.

 

1. How many variables are in this dataset?

 

2. How many observations are in this dataset?

 

3. What are the elements (entities) in this dataset? In other words, the data are observations of what?

 

4. Calculate simple descriptive statistics for the variables income and age. Does either variable appear to be skewed? How do you know?

 

5. Calculate the correlation coefficient between income and age. Also, produce a scatterplot for these two variables. Is the correlation positive or negative? Is the correlation strong or weak?

 

6. Tabulate (i.e. get a frequency distribution) for the variable employment. How many people in the dataset are unemployed? What percentage of people are unemployed? Produce a graph of the frequency distribution for the variable employment.

 

7. Create a new variable named rural that equals 1 if a person currently lives in a rural area and equals 0 if a person does not currently live in a rural area. Give this variable a label of “lives in rural area”.

 

8. Produce a cross-tabulation between the variables rural and citizen. In the dataset, what percentage of people living in rural areas are citizens?

 

9. Calculate descriptive statistics for the variables income and age separately for people living in a rural area and people not living in a rural area.

 

10. Calculate the correlation coefficient between income and age, but only for people not living in a rural area.

 

11. Create a new variable named senior if a person is 65 years old or older. What percentage of people in the dataset are seniors?

 

 

Comments

Popular posts from this blog

Week 1 Assignment – Data Science Tools

Module 1, Week 2, gretl problem set 2

Week 2 Assignment