utils_data_org: Utilities for data organization

utils_data_orgR Documentation

Utilities for data organization

Description

[Experimental]

Useful function for data organization before statistical analysis

  • add_seq_block(): Add a column with sequential block numeration in multi-environment data sets.

  • recode_factor(): Recode a factor column. A sequential numbering (with possible prefix) is used to identify each level.

  • df_to_selegen_54(): Given a multi-environment data with environment, genotype, and replication, format the data to be used in the Selegen software (model 54).

Usage

add_seq_block(data, env, rep, new_factor = BLOCK, prefix = "", verbose = TRUE)

recode_factor(data, factor, new_factor = CODE, prefix = "", verbose = TRUE)

df_to_selegen_54(data, env, gen, rep, verbose = TRUE)

Arguments

data

A data frame.

env

The name of the column that contains the levels of the environments.

rep

The name of the column that contains the levels of the replications/blocks.

new_factor

The name of the new column created.

prefix

An optional prefix to bind with the new factor.

verbose

Logical argument. If verbose = FALSE the code will run silently.

factor

A column to recode.

gen

The name of the column that contains the levels of the genotypes, that will be treated as random effect.

Author(s)

Tiago Olivoto tiagoolivoto@gmail.com

References

Resende, M.D. V. 2016. Software Selegen-REML/BLUP: a useful tool for plant breeding. Crop Breed. Appl. Biotechnol. 16(4): 330–339. doi: 10.1590/1984-70332016v16n4a49.

Examples


library(metan)
df_ge <- ge_simula(ngen = 2,
                   nenv = 3,
                   nrep = 2) %>%
         add_cols(ENV = c(rep("CACIQUE", 4),
                          rep("FREDERICO", 4),
                          rep("SANTA_MARIA", 4)))
df_ge

# Add sequential block numbering over environments
add_seq_block(df_ge, ENV, REP, prefix = "B")

# Recode the 'ENV' column to "ENV1", "ENV2", and so on.
recode_factor(df_ge,
              factor = ENV,
              prefix = "ENV",
              new_factor = ENV_CODE)

# Format the data to be used in the Selegen software (model 54)
df <- df_to_selegen_54(df_ge, ENV, GEN, REP) %>%
recode_factor(ENV, prefix = "E", new_factor = ENV)



metan documentation built on March 7, 2023, 5:34 p.m.