partition_data: Partition Data Into Shards
In scalablebayesm: Distributed Markov Chain Monte Carlo for Bayesian Inference in Marketing

partition_data

R Documentation

Partition Data Into Shards

Description

A function to partition data into s shards for use in distributed estimation.

Usage

partition_data(Data, s)

Arguments

`Data`	A list of containing either 'regdata' or 'lgtdata' and 'Z'(optional). If 'Data' contains 'lgtdata', it should also contain 'p' number of choice alternatives.
`s`	The number of shards to partition the data into.

Value

A list of 's' shards where each shard contains:

`p`	(integer) - Number of choice alternatives (only if 'Data' contains 'lgtdata')
`lgtdata or regdata`	(list, length: n) - A list of n elements where each element contains 'X', 'y', 'beta', and 'tau'
`Z`	(Matrix) - A n x nz matrix of units chars. Null if 'Data' does not contain Z [Optional]

Author(s)

Federico Bumbaca, Leeds School of Business, University of Colorado Boulder, federico.bumbaca@colorado.edu

References

Bumbaca, F. (Rico), Misra, S., & Rossi, P. E. (2020). Scalable Target Marketing: Distributed Markov Chain Monte Carlo for Bayesian Hierarchical Models. Journal of Marketing Research, 57(6), 999-1018.

Examples


# Generate hierarchical linear data
R=1000 #number of draws
nreg=2000 #number of observational units
nobs=5 #number of observations per unit
nvar=3 #columns
nz=2

Z=matrix(runif(nreg*nz),ncol=nz) 
Z=t(t(Z)-apply(Z,2,mean))
Delta=matrix(c(1,-1,2,0,1,0), ncol = nz) 
tau0=.1
iota=c(rep(1,nobs)) 

## create arguments for rmixture
tcomps=NULL
a = diag(1, nrow=3)
tcomps[[1]] = list(mu=c(-5,0,0),rooti=a) 
tcomps[[2]] = list(mu=c(5, -5, 2),rooti=a)
tcomps[[3]] = list(mu=c(5,5,-2),rooti=a)
tpvec = c(.33,.33,.34)                               
ncomp=length(tcomps)
regdata=NULL
betas=matrix(double(nreg*nvar),ncol=nvar) 
tind=double(nreg) 
for (reg in 1:nreg) { 
  tempout=bayesm::rmixture(1,tpvec,tcomps)
  if (is.null(Z)){
    betas[reg,]= as.vector(tempout$x)  
  }else{
    betas[reg,]=Delta%*%Z[reg,]+as.vector(tempout$x)} 
  tind[reg]=tempout$z
  X=cbind(iota,matrix(runif(nobs*(nvar-1)),ncol=(nvar-1))) 
  tau=tau0*runif(1,min=0.5,max=1) 
  y=X%*%betas[reg,]+sqrt(tau)*rnorm(nobs)
  regdata[[reg]]=list(y=y,X=X,beta=betas[reg,],tau=tau) 
}

Prior1=list(ncomp=ncomp) 
keep=1
Mcmc1=list(R=R,keep=keep)
Data1=list(list(regdata=regdata,Z=Z))

length(Data1)

Data2 = partition_data(Data1, s = 3)
length(Data2)

scalablebayesm documentation built on April 3, 2025, 7:55 p.m.

scalablebayesm index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

scalablebayesm
Distributed Markov Chain Monte Carlo for Bayesian Inference in Marketing

partition_data: Partition Data Into Shards
In scalablebayesm: Distributed Markov Chain Monte Carlo for Bayesian Inference in Marketing

Partition Data Into Shards

Description

Usage

Arguments

Value

Author(s)

References

Examples

Related to partition_data in scalablebayesm...

R Package Documentation

Browse R Packages

We want your feedback!

scalablebayesm Distributed Markov Chain Monte Carlo for Bayesian Inference in Marketing

partition_data: Partition Data Into Shards In scalablebayesm: Distributed Markov Chain Monte Carlo for Bayesian Inference in Marketing

Partition Data Into Shards

Description

Usage

Arguments

Value

Author(s)

References

Examples

Related to partition_data in scalablebayesm...

R Package Documentation

Browse R Packages

We want your feedback!

scalablebayesm
Distributed Markov Chain Monte Carlo for Bayesian Inference in Marketing

partition_data: Partition Data Into Shards
In scalablebayesm: Distributed Markov Chain Monte Carlo for Bayesian Inference in Marketing