label_dye_screen: Label and clean dye screen data

View source: R/tidy_dye_screen.R

label_dye_screenR Documentation

Label and clean dye screen data

Description

A helper function for tidy_dye_screen(). Ensures that individual runs contain the correct columns.

Usage

label_dye_screen(
  .data,
  .layout,
  .type,
  .variable_col = "variable",
  .temp_col = "Temperature",
  .value_col = "value",
  .dye_col = "final_compound",
  .dye_conc_col = "final_concentration",
  .channel_col = "channel_f",
  .drop_layout_empties_from = "dye",
  .keep_additional_cols = "",
  ...
)

Arguments

.data

a long-form tibble containing a single plate of raw DSF data from a dye screen. Must include columns containing information on each of the following things:

  • well, or some other individual reaction-identifying information. This column is coerced to character-type in the output of this function.

  • Temperature, giving the Temperature of each RFU measurement. This column is coerced to numeric-type in the output of this function.

  • Wavelength of measurement A column giving information about the wavelengths at which measurements were made, e.g. "channel_f". This column must be a factor; it is recommended that the factor levels reflect the wavelengths measured by each channel. For example, "channel_f" may be a factor column, with levels of "FAM", "JOE", "TAMRA", "ROX", "Cy5", "Cy5.5". This column is added automatically by read_qtower, but may need to be generated manually for other types of inputs.

  • Raw RFU value, giving the raw values measured. This column is coerced to numeric-type in the output of this function. A by-variable normalized version of this column is also present in the returned tibble.

.layout

a long-form tibble containing plate layout information for the supplied raw data. Must include columns containing information on each of the following things: #'

  • well, or some other individual reaction-identifying information which matches the identifying column in the supplied data.nThis column is coerced to character-type in the output of this function.

  • dye, giving a unique identifying name for the dye that was present in a well. This column is coerced to character-type in the output of this function.

  • dye concentration, a column giving the concentration (in uM) at which the dye was tested. This column is coerced to numeric-type in the output of this function. uM units are not precisely necessary, but the default name of this column for processing is "dye_conc_uM", so non-uM units may lead to misleading column labeling if default column names are used.

.type

a string, giving either "protein" or "buffer". This value will be present in the output as a column entitled "type", with only this value below.

.variable_col

a string, giving the name of the column containing unique identifying information for each trace of data. For DSF runs in which each well was measured in multiple channels, note that "variable" must comprise both the well and measurement channel to be unique, e.g. "A1_FAM", "A1_JOE", etc. Defaults to "variable".

.temp_col

a string, giving the name of the column containing the temperature data. Defaults to "Temperature".

.value_col

a string, giving the name of the column containing the raw RFU values. Defaults to "value".

.dye_col

a string, giving the name of the column containing the unique dye names. Defaults to "final_compound".

.dye_conc_col

a string, giving the name of the column containing the dye concentrations Defaults to "final_concentration".

.channel_col

a string, giving the name of the column containing the channel information for the measurements. Defaults to "channel_f".

.drop_layout_empties_from

a string, giving the name of the column used to drop any NA-containing rows from the output data. Is passed to .drop_from in label_data, and defaults here to "dye"

.keep_additional_cols

a character vector, giving names of any additional columns that should be retained in the final output beyond the default "variable", "Temperature", "value", "dye_conc_uM", "channel_f", "dye". This argument is useful in cases where additional information from the layout should be carried through the dye screen analysis process.

...

additional arguments, passsed to label_data.

Value

a tibble, containing data for a labeled dye screen, of either protein or buffer. Columns are: #' #'

  • variable, a character column holding unique identifiers for each trace.

  • Temperature,

  • value,

  • value_norm,

  • dye, a character column specifying the dye tested in each well.

  • dye_conc_uM, a character column specifying the concentration at which a dye was tested.

  • channel_f, a character column specifying the channel in which the raw values were measured.

  • type, a character column specifying the type of moleceule screened– "protein" or "buffer". Each output may contain only one value for "type". For plates in which buffer and protein were both screened, layouts which mask the wells of the opposite type should be used, to create separate outputs for each type of data.


taiawu/dsfworld_package documentation built on June 18, 2024, 5:39 a.m.