paste | R Documentation |
Concatenate (join) the corresponding and/or consecutive elements of given vectors, after converting them to strings.
paste(..., sep = " ", collapse = NULL, recycle0 = FALSE)
paste0(..., sep = "", collapse = NULL, recycle0 = FALSE)
e1 %x+% e2
strcat(x, collapse = "", na.rm = FALSE)
... |
character vectors (or objects coercible to) whose corresponding/consecutive elements are to be concatenated |
sep |
single string; separates terms |
collapse |
single string or |
recycle0 |
single logical value; if |
e1 , e2 |
character vectors (or objects coercible to) whose corresponding elements are to be concatenated |
x |
character vector (or an object coercible to) whose consecutive elements are to be concatenated |
na.rm |
single logical value; if |
`%x+%`
is an operator that concatenates corresponding
strings from two character vectors (and which behaves just like
the arithmetic `+`
operator).
strcat
joins (aggregates based on string concatenation)
consecutive strings in a character vector, possibly with
a specified separator in place, into a single string.
paste
and paste0
, concatenate a number
of vectors using the same separator and then possibly join them into
a single string. We recommend using
`%x+%`
, sprintf
, and strcat
instead
(see below for discussion).
A character vector (in UTF-8).
`%x+%`
preserves object attributes in a similar way as
other Arithmetic operators (however, they may be lost
during as.character(...)
conversion, which is an S3 generic).
strcat
is an aggregation function, therefore it
preserves no attributes whatsoever.
Currently, paste
and paste0
preserve no attributes too.
Replacement for base paste
implemented with stri_join
.
Note that paste
can be thought of as a string counterpart
of both the `+`
operator (actually, some languages do have a binary
operator for string concatenation, e.g., `.`
in Perl and PHP,
`+`
(str.__add__
) in Python; R should have it too,
but does not) which is additionally vectorised ('Map') and the
sum
function ('Reduce').
Therefore, we would expect it to behave similarly with regards
to the propagation of missing values and the preservation of object
attributes, but it does not.
missing values treated as "NA"
strings (it is a well-documented
feature though) [fixed here]
partial recycling with no warning "longer object length is not a multiple of shorter object length" [fixed here]
empty vectors are treated as vectors of empty strings [fixed here]
input objects' attributes are not preserved
[fixed only in `%x+%`
operator]
paste0
multiplies entities without necessity;
sep=""
should be the default in paste
[not fixed]
paste0
treats the named argument sep="..."
as one
more vector to concatenate
[fixed by introducing sep
argument]
overloading `+.character`
has no effect in R, because S3
method dispatch is done internally with hard-coded support for
character arguments. We could have replaced the generic `+`
with the one that calls UseMethod
, but the
dispatch would be done on the type of the first argument anyway
(not to mention it feels like a too intrusive solution).
Actually having a separate operator for concatenation (similar to
PHP's or Perl's `.`
) which always coerces to character
frees the user from manual coercion (is it such a burden on the other
hand?)
[fixed by introducing `%x+%`
operator]
It should also be noted that paste
with collapse=NULL
is a
special case of sprintf
(which is featured in many programming
languages; R's version is of course vectorised).
For instance, paste(x, y, sep=",")
is equivalent to sprintf("%s,%s", x, y)
.
Taking into account the above, paste
and paste0
seem
redundant and hence we mark them as [DEPRECATED].
Here are our recommendations:
the most frequent use case - concatenating corresponding
strings from two character vectors with no separator - is covered
by a new operator `%x+%`
which propagates NAs correctly
and handles object attributes the same way as the built-in arithmetic
operators;
for fancy elementwise (like 'Map') concatenation,
use our version of sprintf
;
for the 'flattening' of consecutive strings in a character vector
(like 'Reduce'), use the new function strcat
.
The official online manual of stringx at https://stringx.gagolewski.com/
Related function(s): strrep
, sprintf
# behaviour of `+` vs. base::paste vs. stringx::paste
x <- structure(c(x=1, y=NA, z=100, w=1000), F="*")
y1 <- structure(c(a=1, b=2, c=3), G="#", F="@")
y2 <- structure(c(a=1, b=2, c=3, d=4), G="#", F="@")
y3 <- structure(1:4, G="#", F="@", dim=c(2, 2), dimnames=list(NULL, c("a", "b")))
x + y1
x + y2
x + y3
y2 + x
base::paste(x, y1)
base::paste(x, y2)
base::paste(x, y3)
stringx::paste(x, y1)
stringx::paste(x, y2)
stringx::paste(x, y3)
base::paste(x, character(0), y2, sep=",")
stringx::paste(x, character(0), y2, sep=",")
x %x+% y1
x %x+% y2
x %x+% y3
y2 %x+% x
x %x+% character(0)
strcat(x, collapse=",")
strcat(x, collapse=",", na.rm=TRUE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.