Non-respons
What is non-response?
The consequences of non-response
Causes of non-response
Demonstration: the effect of non-response

What is non-response?

BA lot of things can go wrong in a survey. One of the most important problems is non-response. It is the phenomenon that the required information is not obtained from the persons selected in the sample.


The consequences of non-response

One effect of non-response is that is reduces the sample size. This does not lead to wrong conclusions. Due to the smaller sample size, the precision of estimators will be smaller. The margins of error will be larger.

A more serious effect of non-response is that it can be selective. This occurs if, due to non-response, specific groups are under- or over-represented in the survey. If these groups behave differently with respect to the survey variables, this causes estimators to be biased. To say it in other word: estimates are significantly too high or too low.

Example: surveys of Statistics Netherlands
Selective non-response is not uncommon. It occurs in a number of surveys of Statistics Netherlands. A follow-up study of the Dutch Victimization Survey showed that persons, who are afraid to be home alone at night, are less inclined to participate in the survey. In the Dutch Housing Demand Survey, it turned out that people who refused to participate, have lesser housing demands than people who responded. And for the Survey of Mobility of the Dutch Population it was obvious that the more mobile people were under-represented among the respondents.

Causes of non-response

Non-response can have different causes. It is a good idea to distinguish these various types of non-response. Research has shown that different types of non-response may have different effects on estimators.

No-contact Refusal Not-able

The first step in getting the participation of a sample person in a survey is to make contact. If this is not possible, you have non-response due to no-contact.

If it is possible to make contact with a person, you can establish whether he or she belongs to the target population of the survey. If not, you can discard this case. You can ignore this person, because it is case of over-coverage. If a person belongs to the target population, you have to persuade him to co-operate. If this is not successful, you have a case of non-response due to refusal.

Even if there is contact, and the person wants co-operate, there can still be circumstances preventing obtaining answers to the questions. Examples are illness or language probems. This is non-response due to not-able.

AIf selected persons belongs to the target population, can be contacted, are prepared to participate, and are able to participate, then you have response.

Demonstratie: het effect van non-respons

There will be general elections in the country of Samplonia. The National Elderly Party (NEP) seems to do well in the campaign. An opinion poll is carried out to estimate the percentage of voters this party will attract. To determine how precise the estimator is, sample selection is repeated a large number of times. The percentage of voters is computed for each sample. The distribution of all these estimates is shown in a histogram.

The average of all estimates is computed. The estimators is unbiased if this average is (approximately) equal to the true population percentage (25.4%).

p>To carry out a simulation, you first set the sample size. You do that by clicking on the green square adjacent to Sample size. There are three possible sample sizes: 200, 400 or 800.

You can choose to generate non-response in the survey. You do that be clicking on the green square below Non-response. The probability of non-response increases with age in this demonstration. For young people, this probability is equal to 80%, fior middle-aged people it is 50%, and for elderly, the probability of non-response is 20%.

You start the simulation by clciking on Start.

If there is no non-response, the estimates will be neatly concentrated around the true percentage of voters in the population (25.4%). If there is non-response, the estimates will be significantly too low.

Why are to estimates too low? The reason is the elderly are unde-represented in the samples, because non-response is highest among them. It are the elderly who vote for NEP. So, there will be too few NEP-voters in the samples.

Note that non-response causes the variation of the estimates to increase. This is alo a typical non-response effect. Non-response reduces the sample size, and therefore increases the variance of estimators, leading to larger margins of error.