There is currently no plan to release vcfR 2.0.0. If and when this 'major' release occurs it will include changes that will break backward compatibility. At the present, this is simply a to-do list for ideas to include in the next major release.
gzread
needs us to tell it a maximum number of bytes to read in, so we need to make an arbitrary guess.
I think I encountered a situation where 4-96 was not enough so I've bumped it to 16,384 B.samples
to vcfR method [
.extract.gt()
could be greatly improved with multithreading. While he used mclapply()
I do not feel this is the best solution because it does not work on Windows. I think a better solution would be RCppParallel because this should work on all CRAN platforms.Released on CRAN 2023-12-07 * Ran usethis::use_package_doc()
Released on CRAN 2023-02-10 Compile time 'nodiscard' attribute: changed 'col_vec.size()' to 'sizeof(col_vec)' in ad_frequency.cpp and masplit.cpp vcfR_to_tidy handles no INFO in meta
Released on CRAN 2022-07-16
* Added vcfR2hapmap()
to convert data for use in GAPIT
Released on CRAN 2020-09-01
Added PKGTYPE: both
to appveyor.yml so Windows packages can be built from source
Omitted configure
file that unnecessarily tried to invoke checkbashisms
* Incorporated help from https://stackoverflow.com/a/62721142 to use checkbashisms when checking on Debian flavors of Linux (ended up omitting this change but left this here to document it and the link)
Released on CRAN 2020-06-05 * Now compatible with R 4.0.0 and dplyr 1.0.0
Released on CRAN 2020-02-06 Handled deprecated "dplyr::verb_" function in vcfR2tidy Omitted unused elipses from proc.chromR()
Released on CRAN 2020-01-10
Changed class(x) == "matrix" to inherited(x, "matrix")
Changed license from GPL
to GPL-3
(#144).
extract.haps()
reports the correct number of variants processed when verbose.
The square brackets ([]) handle @gt slots with no samples.
vcfR2loci()
now has the option return.alleles = FALSE
.
vcfR2genind()
now has the option retrun.alleles = FALSE
.
Error handling code moved into the C++ functions called by read.vcfR so that errors are thrown earlier when reading a VCF. read.vcfR no longer checks that a file is readable first, which solves issues sometimes seen with shared files. (Issue #109, reported and fixed by @NikNakk).
extract.haps()
did not include the parameter return.alleles = TRUE
in it's call to extract.gt()
in the haploid branch of the function. This parameter has now been added. This also affects vcfR2DNAbin()
which calls this function.
vcfR2genlight()
includes the parameter ...
to pass parameters to adegenet::df2genind()
.
is.indel()
returns logical vector to identify indels.
* gt.to.popsum now handles genotypes that include some, but not all, missing alleles.
Released on CRAN 2018-04-17
Attempted to address CRAN's 'Note: break used in wrong context: no loop is visible' issue.
.vcf_stats_gz()
reports number of elements in header as well as the file's last line. This is used by read.vcfR()
to check for poorly formed files.
show
method for vcfR now queries @fix instead of @gt.
check_keys()
checks key definitions in the meta section to make sure they are unique.
freq_peak_plot()
has parameter posUnits
to adjust units of scatterplot.
vcfR2migrate()
manual discusses Unix and Windows line endings.
Released on CRAN 2018-02-07.
vcf_field_names()
now delimts on KEY= of key/value pairs, allows commas to be used within value.
read.vcfR()
will download files when provided with a link.
* Added example data from the Variant Effect Predictor (vep) data(vep)
.
Released on CRAN 2017-12-08.
vcf_field_names()
now handles keys that are out of order and multiple optional keys.
vcfR2DNAbin()
can include indels and maintains alignment.
write.vcf()
now handles tilde expansion.
rePOS()
attempts to create a non-overlapping coordinate system from POS and CHROM.
vcfR2DNAbin()
manages the asterisk allele.
extract.indels()
ignores GATK's .
Added support for chromR objects with no gt slot to proc.chromR()
.
Created peak_to_ploid()
to call peaks and calculate dfe from freq_peak()
output.
Created freq_peak_plot()
to help visualize the output of freq_peak()
.
.vcf_stats_gz
now has nrows and skip parameters.
removed .Call()
statements to standardize style.
Created vcfR2migrate()
to output MigrateN format data.
Addressed clang-UBSAN memory leak in freq_peak()
.
Created pairwise_genetic_diff()
to calculate pairwise differentiation. Thanks Javier!
Released on CRAN 2017-05-18.
genetic_diff()
to calculate fixation indicies.pinfsc50.png
to tools.samples
parameter to vcfR method [
.length()
method for chromR objects.[
method throws warning if FORMAT is omitted.plot()
for signature 'chromR' handles INFO column when its all NA.create.chrom()
subsets to first chromosome when more than one is provided.Released on CRAN 2017-01-07.
masplit()
converts '.' to NA.extract.indels()
does not recognize NA as a deletion.getINFO
to getFIX()
to suppress INFO column.Released on CRAN 2016-12-08.
extract.gt()
no longer uses parameter allele.sep()
. create.chromR()
the data are subset to the first chromosome. Thank you Christian!convertNA
parameter to extract.gt()
to allow preservation of VCF encoding of missing data. Thank you Thierry!convertNA
parameter to read.vcfR()
to allow preservation of VCF encoding of missing data. Thank you Thierry!freq_peak()
to find peaks in allele balance frequency data.masplit()
to parse matrices contains delimited strings.ordisample()
to ordinate sample information.extract.gt()
can now use the ID column from the fix region for rownames.INFO2df()
and metaINFO2df()
.Released on CRAN 2016-07-25.
vcfR2genind()
greps genotypes containing a missing allele ('.') and sets to NA.is_het()
rapidly identifies heterozygotes.extract.info()
scores missing elements as NA.Released on CRAN 2016-05-26.
This release includes the incorporation of suggestions made by reviewers of the manuscript submitted to Molecular Ecology Resources.
is.het()
to identify heterozygotes in a matrix of genotypes.vcfR2DNAbin
where a variant one position beyond the locus would attempt to be included but threw an error.vcfR_test
as lightweight test VCF data.AD_frequency
calculates allele frequencies from matrices of AD data.read.vcfR()
handles VCF data with no GT region (ala LoFreq).gt2alleles
handles missing data ('.').read.vcfR()
checks for and removes carriage returns (Windows).vcfR2DNAbin
converts 'NA' to 'n' prior to conversion to DNAbin.chromR2vcfR
implements use.mask.extract.gt()
converts "." to NA.write.vcf()
now uses mask = TRUE.maf()
provides counts and frequency for the minor (or other) allele.create.chromR()
now handles instances with no seq and the annotation position exceeds the greatest VCF POS.read.vcfR()
now handles tilde expansion.addID()
populates the non-missing values in the ID column of VCF data by concatenating the chromosome and position. Released on CRAN 2016-02-22. This release was used to prepare the manuscript that was submitted to Molecular Ecology Resources.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.