Module 1, Week 1, gretl Problem Set 1
Module 1, Week 1, gretl Problem Set
1
Use
the “Mass Transit Use and Employment” dataset and gretl to answer the following
questions.
1.
How many variables are in this dataset?
2.
How many observations are in this dataset?
3.
What are the elements (entities) in this dataset? In other words, the data are
observations of what?
4.
Calculate simple descriptive statistics for the variables income and age. Does
either variable appear to be skewed? How do you know?
5.
Calculate the correlation coefficient between income and age. Also, produce a
scatterplot for these two variables. Is the correlation positive or negative?
Is the correlation strong or weak?
6.
Tabulate (i.e. get a frequency distribution) for the variable employment. How
many people in the dataset are unemployed? What percentage of people are unemployed?
Produce a graph of the frequency distribution for the variable employment.
7.
Create a new variable named rural that equals 1 if a person currently lives in
a rural area and equals 0 if a person does not currently live in a rural area.
Give this variable a label of “lives in rural area”.
8.
Produce a cross-tabulation between the variables rural and citizen. In the
dataset, what percentage of people living in rural areas are citizens?
9.
Calculate descriptive statistics for the variables income and age separately
for people living in a rural area and people not living in a rural area.
10.
Calculate the correlation coefficient between income and age, but only for people
not living in a rural area.
11.
Create a new variable named senior if a person is 65 years old or older. What
percentage of people in the dataset are seniors?
Comments
Post a Comment