bracketX: Bracket Parsing

Description Usage Arguments Value Warning Author(s) References See Also Examples

Description

bracketX - Apply bracket removal to character vectors.

bracketXtract - Apply bracket extraction to character vectors.

genX - Apply general chunk extraction to character vectors.

genXtract - Apply general chunk removal to character vectors.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
  bracketX(text.var, bracket = "all", missing = NULL,
    names = FALSE)

  bracketXtract(text.var, bracket = "all", with = FALSE,
    merge = TRUE)

  genX(text.var, left, right, missing = NULL,
    names = FALSE)

  genXtract(text.var, left, right, with = FALSE,
    merge = TRUE)

Arguments

text.var

The text variable

bracket

The type of bracket (and encased text) to remove. This is one or more of the strings "curly", "square", "round", "angle" and "all". These strings correspond to: {, [, (, < or all four types.

missing

Value to assign to empty cells.

names

logical. If TRUE the sentences are given as the names of the counts.

with

logical. If TRUE returns the brackets and the bracketed text.

merge

logical. If TRUE the results of each bracket type will be merged by sentence. FALSE returns a named list of lists of vectors of bracketed text per bracket type.

left

A vector of character or numeric symbols as the left edge to extract.

right

A vector of character or numeric symbols as the right edge to extract.

Value

bracketX - returns a vector of text with brackets removed.

bracketXtract - returns a list of vectors of bracketed text.

genXtract - returns a vector of text with checks removed.

genX - returns a list of vectors of removed text.

Warning

the gen functions are more flexible than the bracket functions but are not compatible with special regex characters. It is recommended that the researcher use the bracket functions.

Author(s)

Martin Morgan and Tyler Rinker <tyler.rinker@gmail.com>.

References

http://stackoverflow.com/questions/8621066/remove-text-inside-brackets-parens-and-or-braces

See Also

regex

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
examp <- structure(list(person = structure(c(1L, 2L, 1L, 3L),
    .Label = c("bob", "greg", "sue"), class = "factor"), text =
    c("I love chicken [unintelligible]!",
    "Me too! (laughter) It's so good.[interrupting]",
    "Yep it's awesome {reading}.", "Agreed. {is so much fun}")), .Names =
    c("person", "text"), row.names = c(NA, -4L), class = "data.frame")

examp
bracketX(examp$text, "square")
bracketX(examp$text, "curly")
bracketX(examp$text, c("square", "round"))
bracketX(examp$text)


bracketXtract(examp$text, "square")
bracketXtract(examp$text, "curly")
bracketXtract(examp$text, c("square", "round"))
bracketXtract(examp$text, c("square", "round"), merge = FALSE)
bracketXtract(examp$text)
bracketXtract(examp$text, with = TRUE)

paste2(bracketXtract(examp$text, "curly"), " ")

x <- c("Where is the /big dog#?",
    "I think he's @arunning@b with /little cat#.")
genXtract(x, c("/", "@a"), c("#", "@b"))

x <- c("Where is the L1big dogL2?",
    "I think he's 98running99 with L1little catL2.")
genXtract(x, c("L1", 98), c("L2", 99))

DATA$state  #notice number 1 and 10
genX(DATA$state, c("is", "we"), c("too", "on"))

trinker/qdap2 documentation built on May 31, 2019, 9:47 p.m.