knitr::opts_chunk$set( collapse = TRUE, comment = "#>" )
We recommend installing the stable, peer-reviewed version of TruncExpFam, available on CRAN:
install.packages("TruncExpFam")
After successful installation, the package can be loaded with
library(TruncExpFam)
TruncExpFam comes equipped with functions to generate random samples from no
less than 12 different probability distributions from the truncated exponential
family. You can read more about them by running ?rtrunc
on your R console.
As an example, we will sample 100 values from a chi-square distribution with 14 degrees of freedom:
x <- rtrunc(100, family = "chisq", df = 14)
By default, however, rtrunc()
doesn't generate a truncated distribution. As
a matter of fact, the code above will generate the exact same sample as if
were drawn from stats::rchisq()
, watch:
set.seed(3067) x2 <- rtrunc(20, "chisq", df = 14) set.seed(3067) x3 <- rchisq(20, 14) identical(x2, x3)
Oh, wait... Those objects are supposed to be identical! What happened? Let's investigate:
x2 x3 str(x2) str(x3) class(x2) class(x3)
OK, so you can tell that the generated numbers are the same, but x2
and x3
are not literally the same objects because the former has a different class.
These trunc_*
classes are actually very special, because they contain some
extra information about the distribution that a simple vector does not. One
can access such information using print(x2, details = TRUE)
:
print(x2, details = TRUE)
Just to be sure that the sample itself matches:
identical(as.vector(x2), x3)
Speaking of alternative ways to generate the same sample, for the sake of
convenience and of users familiar with the sampling functions
from the stats package, the wrapper function rtruncchisq()
is also available.
The results, as you can see below, are identical:
set.seed(2912) x4 <- rtrunc(1e4, "chisq", df = 14) set.seed(2912) x5 <- rtruncchisq(1e4, df = 14) identical(x4, x5)
So far, all samples generated are actually not truncated. This is because,
by default, the truncation limits a
and b
are set to the limits of the
distribution support, which are 0 and Inf for the chi-squared distribution.
Let us use a simpler distribution for this second example by sampling from Poisson(10):
y1 <- rtruncpois(1e4, 10) summary(y1) var(y1)
As expected, the values are all larger than 0 and the mean and variance are 10. If we wanted to generate instead from a Poisson(10) truncated at, say, 8 and 20, we would run:
y2 <- rtruncpois(1e4, 10, a = 9, b = 20) summary(y2) var(y2)
Notice how, even with a large sample, the observed mean and variance are still quite far from 10.
One reliable method of estimating the original lambda used to generate y2
is
by running the mlEstimationTruncDist()
function:
lambda <- mlEstimationTruncDist(y2, print.iter = TRUE) lambda
More information about that function and how you can tweak it is available on
?mlEstimationTruncDist
.
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.