docs/DSQ.md

What Is DSQ?

DSQ stands for Dietary Screener Questionnaires (DSQ), a 26-item survey instrument developed by NIH's National Cancer Instritute (NCI).

Usage

The basic usage is

df_out <- DSQ_Score(df_in)

where df_in is the input data frame and df_out is the output data frame.

Input Variables

The input data frame requires the following variables:

| Fruved Data Dictionary | SAS Variable | Choice | Score | | |------------------------|--------------|--------|-------|---| | Age | | | | | | Gender | | | | | | | | | | |

Output Variables

| Fruved Data Dictionary | SAS Variable | Choice | Score | | |------------------------|--------------|--------|-------|---| | Age | | | | | | Gender | | | | | | | | | | |

To-Do

How to Score Your Data in R?

Step 1. Prepare the Data

We provide the following utility function in R to rename your variable names. Please note that you still need to ensure that the data encoding is correct for each variable.

data <- DSQ_Variables(data, UNIQUEID="ID",
                      DSQ_xx1="Age",
                      DSQ_xx2="Gender",
                      DSQ_010="Dsqcereal",
                      DSQ_020="Dsqcertyp1",
                      DSQ_xx3="Dsqcertyp2",
                      DSQ_030="Dsqmilk1",
                      DSQ_040="Dsqsoda",
                      DSQ_050="Dsqjuice",
                      DSQ_060="Dsqcoffee",
                      DSQ_070="Dsqdrink",
                      DSQ_080="Dsqfruit",
                      DSQ_090="Dsqsalad",
                      DSQ_100="Dsqfried",
                      DSQ_110="Dsqpotato",
                      DSQ_120="Dsqbean",
                      DSQ_130="Dsqveg",
                      DSQ_140="Dsqpizza",
                      DSQ_150="Dsqsalsa",
                      DSQ_160="Dsqsauce",
                      DSQ_190="Dsqcheese",
                      DSQ_180="Dsqproc",
                      DSQ_200="Dsqbread",
                      DSQ_210="Dsqgrain",
                      DSQ_220="Dsqcandy",
                      DSQ_230="Dsqrolls",
                      DSQ_240="Dsqcake",
                      DSQ_250="Dsqice",
                      DSQ_260="Dsqcorn")

Once you save the output data frame data in a standard data file (such as CSV), you can import it in SAS, and run the official SAS scoring script.

Step 2. Run the Scoring Code

The scoring algorithms are well documented on its official website. There is an earlier method and a current method. The current method is recommended. Both methods have been implemented in SAS. This package implements the current method in R.

Once your data (encapsulated in data frame named data) is cleaned and formatted according to the official codebook, you can run this:

data_scored <- DSQ_Scores(data)

By default, the output data frame data_scored will only include the computed variables.

Example

Input and Output

DSQ Input Data Formatting Requirements

Our scoring code is based on the Self-Administered Questionnaire: Paper version and we followed its codebook to the extent possible. There are a few discrepancies between the codebook and the official scoring code (see details below). In those cases, we follow the requirement of the official scoring code. This documentation summarizes the variable naming and value encoding requirements we implemented.

First, each record should be assigned a unique identifier SEQN.

To score DSQ, two demographics variables are required:

The following cereal variables are collected: * DSQ_010 : During the past month, how often did you eat hot or cold cereals? Choices are: - 'A' = Never - 'B' = 1 time last month - 'C' = 2-3 times last month - 'D' = 1 time per week - 'E' = 2 times per week - 'F' = 3-4 times per week - 'G' = 5-6 times per week - 'H' = 1 time per day - 'I' = 2 or more times per day.

The following drinks are asked of their frequency during the past month. Choices are from 'A' to 'K', as in DSQ_030. DSQ_030 : have any milk (either to drink or on cereal)? Choices are: - 'A' = Never - 'B' = 1 time last month - 'C' = 2-3 times last month - 'D' = 1 time per week - 'E' = 2 times per week - 'F' = 3-4 times per week - 'G' = 5-6 times per week - 'H' = 1 time per day - 'I' = 2-3 times per day - 'J' = 4-5 times per day - 'K' = 6 or more times per day DSQ_040 : drink regular soda or pop that contains sugar? DSQ_050 : drink 100% pure fruit juices such as orange, mango, apple, grape and pineapple juices? DSQ_060 : drink coffee or tea that had sugar or honey added to it? * DSQ_070 : drink sweetened fruit drinks, sports or energy drinks, such as Kool-Aid, lemonade, Hi-C, cranberry drink, Gatorade, Red Bull or Vitamin Water?

The following foods are asked of their frequency during the past month. Choices are from 'A' to 'I', the same as in DSQ_010. DSQ_080 : eat fruit? DSQ_090 : eat green leafy or lettuce salad, with or without other vegetables? DSQ_100 : eat any kind of fried potatoes, including French fries, home fries, or hash brown potatoes? DSQ_110 : eat any other kind of potatoes, such as baked, boiled, mashed potatoes, sweet potatoes, or potato salad? DSQ_120 : eat refried beans, baked beans, beans in soup, pork and beans or any other type of cooked dried beans? DSQ_210 : eat brown rice or other cooked whole grains, such as bulgur, cracked wheat, or millet? Do not include white rice. DSQ_130 : not including what you just told me about (green salads, potatoes, cooked dried beans), how often did you eat other vegetables? DSQ_150 : have Mexican-type salsa made with tomato? DSQ_140 : eat pizza? Include frozen pizza, fast food pizza, and homemade pizza. DSQ_160 : have tomato sauces such as with spagetti or noodles or mixed into foods such as lasagna? DSQ_190 : eat any kind of cheese? Include cheese as a snack, cheese on burgers, sandwiches, and cheese in foods such as lasagna, quesadillas, or casseroles. DSQ_180 : eat any processed meat, such as bacon, lunch meats, or hot dogs? DSQ_200 : eat whole grain bread including toast, rolls and in sandwiches? DSQ_220 : eat chocolate or any other types of candy? DSQ_230 : eat doughnuts, sweet rolls, Danish, muffins, pan dulce, or pop-tarts? DSQ_240 : eat cookies, cake, pie or brownies? DSQ_250 : eat ice cream or other frozen desserts? DSQ_260 : eat popcorn?

The following food intake variables are collected but not used for scoring: DSQ_xx4 : what kind of milk did you usually drink DSQ_xx4os : what kind of milk did you usually drink? Other Specify * DSQ_170 : eat red meat, such as beef, pork, ham, or sausage?

DSQ Output

See NCI website.



wzhou7/Fruved documentation built on June 15, 2025, 1:16 a.m.