Pixsets: representing pixel sets in imager

knitr::opts_chunk$set(warning=FALSE, message=FALSE, cache=FALSE, 
               comment=NA, verbose=TRUE, fig.width=5, fig.height=5,dev='jpeg',dev.args=list(quality=50))

If you've already gone through the "Getting started" vignette, you know about image objects ("cimg" class). This vignette introduces pixsets, which is the other important kind of objects. Pixsets represent sets of pixels, or equivalently binary images, AKA "masks".

From images to pixsets and back

A pixset is what you get if you run a test on an image:

im <- load.example('parrots') %>% grayscale
px <- im > .6 #Select pixels with high luminance

Internally, a pixel set is just an array of logical (boolean) values:


The "TRUE" values correspond to pixels in the set, and "FALSE" to pixels not in the set. The dimensions of the pixset are the same as that of the original image:

all(dim(px) == dim(im))

To count the number of pixels in the set, use sum:

sum(px) #Number of pixels in set
mean(px) #Proportion

Converting a pixset to an image results in an image of the same size with zeroes and ones:

##same thing: automatic conversion to a numeric type
px + 0

Indexing using pixsets

You can use pixsets the same way you'd normally use an array of logicals, e.g. for indexing:

which(px) %>% head

Plotting and visualising pixsets

The "highlight" function is a good way of visualising pixel sets:

px <- (isoblur(im,4)  > .5 )

highlight extracts the contours of the pixset (see ?contours) and plots them.

You can also use plain old "plot":


It converts "px" to an image and uses plot.cimg.

Coordinates for pixels in pixsets

The where function returns coordinates for pixels in the set:

where(px) %>% head

where returns a data.frame. That format is especially convenient if you want to compute some statistics on the coordinates, e.g., the center of mass of a region defined by a pixset:

where(px) %>% dplyr::summarise(mx=mean(x),my=mean(y))

Selecting contiguous regions, splitting into contiguous regions

In segmentation problems one usually wants contiguous regions: px.flood uses the flood fill algorithm (AKA the bucket tool in image editors) to select pixels based on similarity.

#Start the fill at location (180,274). sigma sets the tolerance
px.flood(im,180,274,sigma=.21) %>% highlight

It's also common to want to split a pixset into contiguous regions: use split_connected.

sp <- split_connected(px) #returns an imlist 

Each element in the list is a connected pixset. You can use split_connected to check connectedness (there are faster ways, of course, but this is simple):

is.connected <- function(px) length(split_connected(px)) == 1

Use the "high_connectivity" argument to extend to diagonal neighbours as well. See ?label for more.


The boundary function computes the boundaries of the set:

boundary(px) %>% plot
##Make your own highlight function:
boundary(px) %>% where %$% { points(x,y,cex=.1,col="red") }

Growing, shrinking, morphological operations

The grow and shrink operators let you grow and shrink pixsets using morphological operators (dilation and erosion, resp.). Have a look at the article on (morphology)[https://dahtah.github.io/imager/morphology.html] for more:

#Grow by 5 pixels
grow(px,5) %>% highlight(col="green")
#Shrink by 5 pixels
shrink(px,5) %>% highlight(col="blue")

#Compute bounding box
bbox(px) %>% highlight(col="yellow")

Common pixsets

There's a few convenience functions defining convenience pixsets:

px.none(im) #No pixels
px.all(im) #All of them

#Image borders at depth 10
px.borders(im,10) %>% highlight
#Left-hand border (5 pixels), see also px.top, px.bottom, etc.
px.left(im,5) %>% highlight(col="green")

Splitting and concatenating pixsets

imsplit and imappend both work on pixsets.

#Split pixset in two along x
imsplit(px,"x",2) %>% plot(layout="row")
#Splitting pixsets results into a list of pixsets
imsplit(px,"x",2) %>% str

#Cut along y, append along x
imsplit(px,"y",2) %>% imappend("x") %>% plot()

You can use reductions the same way you'd use them on images, which is especially useful when working with colour images.

Working with colour images

Be careful: each colour channel is treated as having its own set of pixels, so that a colour pixset has the same dimension as the image it originated from, e.g.:

px <- boats > .8
where(px) %>% head

Here "px" tells us the location in all locations and across channels of pixels with values higher than .8. If you plot it you'll see the following:


Red dots correspond to pixels in the red channel, green in the green channel, etc. You can also view the set by splitting:

imsplit(px,"c") %>% plot

If you need to find the pixel locations that have a value of .9 in all channels, use a reduction:

#parall stands for "parallel-all", and works similarly to parmax, parmin, etc.
imsplit(px,"c") %>% parall %>% where %>% head

#at each location, test if any channel is in px
imsplit(px,"c") %>% parany %>% where %>% head

#highlight the set (unsurprisingly, it's mostly white pixels)
imsplit(px,"c") %>% parany %>% highlight

An example: segmentation with pixsets

The following example is derived from the documentation for scikit-image. The objective is to segment the coins from the background.

im <- load.example("coins")

A simple thresholding doesn't work because the illumination varies:

threshold(im) %>% plot

It's possible to correct the illumination using a linear model:

d <- as.data.frame(im)
##Subsamble, fit a linear model
m <- sample_n(d,1e4) %>% lm(value ~ x*y,data=.) 
##Correct by removing the trend
im.c <- im-predict(m,d)
out <- threshold(im.c)

Although that's much better we need to clean this up a bit:

out <- clean(out,3) %>% imager::fill(7)

Another approach is to use a watershed. We start from seeds regions representing known foreground and known background pixels:

bg <- (!threshold(im.c,"10%"))
fg <- (threshold(im.c,"90%"))
imlist(fg,bg) %>% plot(layout="row")
#Build a seed image where fg pixels have value 2, bg 1, and the rest are 0
seed <- bg+2*fg

The watershed transform will propagate background and foreground labels to neighbouring pixels, according to a priority map (the lower the priority, the slower the propagation). Using the priority map it's possible to prevent label propagation across image edges:

edges <- imgradient(im,"xy") %>% enorm
p <- 1/(1+edges)

We run the watershed transform:

ws <- (watershed(seed,p)==1)

We still need to fill in some holes and remove a spurious area. To fill in holes, we use a bucket fill on the background starting from the top-left corner:

ws <- bucketfill(ws,1,1,color=2) %>% {!( . == 2) }

To remove the spurious area one possibility is to use "clean":

clean(ws,5) %>% plot

Another is to split the pixset into connected components, and remove ones with small areas:

split_connected(ws) %>% purrr::discard(~ sum(.) < 100) %>%
    parany %>% plot

Here's a comparison of the segmentations obtained using the two methods:


out2 <- clean(ws,5)

Try the imager package in your browser

Any scripts or data that you put into this service are public.

imager documentation built on May 29, 2017, 8:46 p.m.