sepe data set is a sample of the pilot survey in 1993 of the Swiss Federal Statistical Office on environment protection expenditures of Swiss private economy in the previous accounting year. The units are enterprises, the monetary variables are in thousand Swiss Francs (CHF). From the original sample a random subsample was chosen of which certain enterprises were excluded for confidentiality reasons. In addition, noise has been added to certain variables, and certain categories have been collapsed. The data set has missing values. The data set has first been prepared for the EU FP5 project EUREDIT and later been data protected for educational purposes.
A data frame with 675 observations on 23 variables.
categoric variable: 1 = 'non-zero total expenditure', 2 = 'zero total expenditure', 3 = 'no answer to the question'
total investment for water protection
total investment for waste management
total investment for air protection
total investment for noise protection
total investement for other environmental protection areas
overall total investment in all environmental protection areas
total current expenditure in environmental protectiona area water protection
total current expenditure in environmental protectiona area waste management
total current expenditure in environmental protectiona area air protection
total current expenditure in environmental protectiona area noise protection
total current expenditure in other environmental protectiona
overall total current expenditure in all environmental protectiona
total subsidies for environmental protection received
total receipts from environmental protection
number of employees
size class (according to number of employees)
stratum number of sample design
code of economic activity (aggregated)
number of enterprises in the population-stratum
number of employees in population activity group
sampling weight (for extrapolation to the population)
The sample design is stratified random sampling with different sampling rates. Use package survey or sampling to obtain correct point and variance estimates. In addition a ratio estimator may be built using the variable
popemple which gives the total employment per
There are two balance rules: the subtotals of the investment variables should sum to
totinvto and the expenditure subtotals should sum to
The missing values stem from the survey itself. In the actual survey the missing values were declared as ”guessed” rather than copied from records.
The sampling weight
weight is adjusted for non-response in the stratum, i.e.
Swiss Federal Statistical Office (1996), Umweltausgaben und -investitionen in der Schweiz 1992/1993, Ergebnisse einer Pilotstudie.
Charlton, J. (ed.), Towards Effective Statistical Editing and Imputation Strategies - Findings of the Euredit project, unpublished manuscript available from Eurostat and http://www.cs.york.ac.uk/euredit/