seqsplt_addr: Split address strings in a data frame into sequential...
In gmculp/rNYCclean: A package to clean NYC addresses

Description Usage Arguments Value Examples

View source: R/seqsplt_addr.R

The seqsplt_addr function splits address strings in a data frame into sequential combinations of words.

1 2	seqsplt_addr(in_df, new_addr_col_name, id_col_name, addr_col_name, third_col_name, remove_orig = TRUE)

`in_df`	a data frame containing addresses. Required.
`new_addr_col_name`	the name of output addresses column as string. Required.
`id_col_name`	the name of the unique identifier column as string. Required.
`addr_col_name`	the name of the input addresses column as string. Required.
`third_col_name`	the name of either the borough code or zip code column as string. Required.
`remove_orig`	option to exclude original address from output as binary. Optional.

A data frame containing id_col_name, third_col_name, and a column of address strings split into sequential combinations of words.

# create a data frame of addresses
ADDR <- c("ROOM 326 125 WORTH STREET","253 BROADWAY FLR 3",
    "C/O DOHMH 42-09 28 STREET")
BORO_CODE <- c(1,1,4)
u_id <- 1:length(ADDR)
df = data.frame(u_id, ADDR, BORO_CODE)

#split address column into sequential combinations
df1 <- seqsplt_addr(in_df = df, new_addr_col_name = "ADDR.seqsplt",
    id_col_name = "u_id", addr_col_name = "ADDR", 
    third_col_name = "BORO_CODE")

#preview records
head(df1)