netwrite: Network Cleaning and Variable Calculation ('netwrite')
In ideanet: Integrating Data Exchange and Analysis for Networks ('ideanet')

netwrite

R Documentation

Network Cleaning and Variable Calculation (`netwrite`)

Description

The netwrite function reads in relational data of several formats and processes them into a set of standardized outputs. These outputs include sets of commonly calculated measures at the individual node and network-wide levels.

Usage

netwrite(
  data_type = c("edgelist"),
  adjacency_matrix = FALSE,
  adjacency_list = FALSE,
  nodelist = FALSE,
  node_id = NULL,
  node_netid = NULL,
  edgelist = FALSE,
  i_elements = FALSE,
  j_elements = FALSE,
  edge_netid = NULL,
  fix_nodelist = TRUE,
  weights = NULL,
  type = NULL,
  remove_loops = FALSE,
  missing_code = 99999,
  weight_type = "frequency",
  directed = FALSE,
  net_name = "network",
  shiny = FALSE,
  output = c("graph", "largest_bi_component", "largest_component", "node_measure_plot",
    "nodelist", "edgelist", "system_level_measures", "system_measure_plot"),
  message = TRUE
)

Arguments

`data_type`	A character value indicating the type of relational data being entered into `netwrite`. Available options are `edgelist`, `adjacency_matrix`, and `adjacency_list`.
`adjacency_matrix`	If `data_type` is set to `adjacency_matrix`, a matrix object containing the adjacency matrix for the network being processed.
`adjacency_list`	If `data_type` is set to `adjacency_list`, a data frame containing the adjacency list for the network being processed.
`nodelist`	Either a vector of values indicating unique node/vertex IDs, or a data frame including all information about nodes in the network. If the latter, a value for `node_id` must be specified.
`node_id`	If a data frame is entered for the `nodelist` argument, `node_id` should be a character value indicating the name of the column in the node-level data frame containing unique node identifiers.
`node_netid`	If a data frame is entered for the `nodelist` argument, `node_netid` should be a character value indicating the name of the column in the node-level data frame containing unique network identifiers. This argument should be specified if a value is given for `edge_netid`.
`edgelist`	A data frame including all ties in the network. If this argument is specified, `i_elements`, `j_elements`, `edge_netid` (if applicable), `weights` (if applicable), and `type` (if applicable), must be specified as single character values indicating the names of their respective columns.
`i_elements`	If `data_type` is set to `"edgelist"`, a vector of identifiers indicating the senders of ties in the edgelist, or a single character value indicating the name of the column in `edgelist` containing these identifiers.
`j_elements`	If `data_type` is set to `"edgelist"`, a vector of identifiers indicating the receivers of ties in the edgelist, or a single character value indicating the name of the column in `edgelist` containing these identifiers.
`edge_netid`	If `data_type` is set to `"edgelist"`, a vector of identifiers indicating the specific network to which a particular tie in the edgelist belongs, or a single character value indicating the name of the column in `edgelist` containing network identifiers.
`fix_nodelist`	If `data_type` is set to `"edgelist"` and user inputs a vector or data frame into `nodelist`, a logical value indicating whether to include node IDs that do not appear in the nodelist but do appear in the edgelist in the nodelist used when processing network data. By default, `fix_nodelist` is set to `FALSE` to identify potential inconsistencies between the nodelist and edgelist to the user.
`weights`	If `data_type` is set to `"edgelist"`, a numeric vector indicating the weight of ties in the edgelist, or a single character value indicating the name of the column in `edgelist` containing tie weights. `netwrite` requires that all edge weights be positive values.
`type`	If `data_type` is set to `"edgelist"`, a numeric or character vector indicating the types of relationships represented in the edgelist, or a single character value indicating the name of the column in `edgelist` containing tie types. If `type` is specified, `netwrite` will treat network(s) as multi-relational and produce additional outputs reflecting the different types of ties appearing in the data.
`remove_loops`	A logical value indicating whether "self-loops" (ties directed toward oneself) should be considered valid ties in the network being processed.
`missing_code`	A numeric value indicating "missing" values in an edgelist. Such "missing" values are sometimes included to identify the presence of isolated nodes in an edgelist when a corresponding nodelist is unavailable.
`weight_type`	A character value indicating whether edge weights should be treated as frequencies or distances. Available options are `"frequency"`, indicating that higher values represent stronger ties, and `"distance"`, indicating that higher values represent weaker ties. Note: some underlying functions assume that edges represent distances. If `weight_type` is set to `"frequency"`, these functions will use the reciprocal of `weights` as distance values in calculation.
`directed`	A logical value indicating whether edges should be treated as a directed or undirected when constructing the network.
`net_name`	A character value indicating the name to which network/igraph objects should be given.
`shiny`	A logical value indicating whether `netwrite` is being used in conjunction with IDEANet's Shiny-based visualization app. `shiny` should also be set to `TRUE` when using `ideanet` in an R Markdown file that users expect to knit into a document.
`output`	A character vector indicating the kinds of objects `netwrite` should assign to the global environment. `netwrite` produces several outputs that may not all be necessary to a user's needs. Users can specify which outputs they specifically want in order to minimize the number of objects appearing in the global environment. Potential outputs include igraph object(s) (`"graph"`), subgraph(s) of only nodes that appear in the largest component and/or bicomponent of the network (`"largest_component"`, `"largest_bi_component"`), data frame(s) containing node-level measures (`"node_measure_plot"`), a processed edgelist of the network (`"edgelist"`), a data frame indicating network-level summaries (`"system_level_measures"`), and summary visualizations for node- and network-level measures (`"node_measure_plot"`, `"system_measure_plot"`).
`message`	A logical value indicating whether warning messages should be displayed in the R console during processing.

Value

netwrite returns a list containing several output objects. Users may find it easier to access and work with outputs by applying list2env to this list, which will separate outputs and store them in the R Global Environment. Note, however, that this risks overwriting existing objects in the Global Environment should those objects share names with objects in netwrite's output. Depending on the values assigned to the output argument, netwrite will produce any or all of the following:

If output contains graph, netwrite will return an igraph object of the network represented in the original data. If a vector is entered into the type argument, netwrite also produces a list containing igraph objects for each unique relation type as well as the overall network. These output objects are named according to the value specified in the net_name argument.

If output contains "nodelist", netwrite will return a dataframe containing individual-level information for each node in the network. This dataframe contains a set of frequently used node-level measures for each node in the network. If a vector is entered into the type argument, netwrite will produce these node-level measures for each unique relation type.

If output contains "edgelist", netwrite will return a formatted edgelist for the network represented in the original data. If a vector is entered into the type argument, netwrite also produces a list containing edgelists for each unique relation type as well as the overall network.

If output contains "system_level_measures", netwrite will return a data frame providing network-level summary information.

If output contains "node_measure_plot", netwrite will return a plot summarizing the distribution of frequently used node-level measures across all nodes in the network. If a vector is entered into the type argument, netwrite also produces a list containing node-level summary plots for each unique relation type as well as the overall network.

If output contains "system_measure_plot", netwrite will return a plot summarizing the distribution of frequently used network-level measures. If a vector is entered into the type argument, netwrite also produces a list containing network-level summary plots for each unique relation type as well as the overall network.

If output contains "largest_bi_component", netwrite will return an igraph object of the largest bicomponent in the network represented in the original data. If a vector is entered into the type argument, netwrite also produces a list containing the largest bicomponent for each unique relation type as well as the overall network.

If output contains "largest_bi_component", netwrite will return an igraph object of the largest main component in the network represented in the original data. If a vector is entered into the type argument, netwrite also produces a list containing the largest main component for each unique relation type as well as the overall network.

If users are working with data containing multiple independent networks, netwrite will return a list containing the above outputs for each network in their data, provided that users have passed a vector of network identifiers to the edge_netid argument. Each network's output will be labeled according to its corresponding value in edge_netid.

Examples

# Use netwrite on an edgelist
nw_fauxmesa <- netwrite(nodelist = fauxmesa_nodes,
                      node_id = "id",
                      i_elements = fauxmesa_edges$from,
                      j_elements = fauxmesa_edges$to,
                      directed = TRUE,
                      net_name = "faux_mesa")

### Inspect updated edgelist
head(nw_fauxmesa$edgelist)

### Inspect data frame of node-level measures
head(nw_fauxmesa$node_measures)

### Inspect system-level summary
head(nw_fauxmesa$system_level_measures)

### Plot sociogram of network
plot(nw_fauxmesa$faux_mesa)

### View node-level summary visualization
nw_fauxmesa$node_measure_plot

### View system-level summary visualization
nw_fauxmesa$system_measure_plot



# Run netwrite on an adjacency matrix

nw_triad <- netwrite(data_type = "adjacency_matrix",
                     adjacency_matrix = triad,
                     directed = TRUE,
                     net_name = "triad_igraph")

ideanet documentation built on June 8, 2025, 1:07 p.m.