authors_clean: Seperates author information in references files from...
In ropensci/refsplitr: author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data

authors_clean

R Documentation

Seperates author information in references files from `references_read`

Description

authors_clean This function takes the output from references_read and cleans the author information.

Usage

authors_clean(references)

Arguments

references

output from references_read

Details

Information on addresses, emails, ORCIDs, etc are matched.

It then attempts to match same author entries together into likely author groups based on common full names, addresses, emails, ORCIDs etc.

Records that are not matched this way have a Jaro-Winkler similiarty analysis metric calculated for all possible matching author names.

This calculates the amount of character similarities based on distance of similar character.

Examples

## Load the refsplitr sample dataset "BITR" 
data(BITR) 
BITR_clean <- authors_clean(BITR)

## The output of authors_clean is a list with two elements, 
## which can be assigend to dataframes.
BITR_review_df <- BITR_clean$review
BITR_prelim_df <- BITR_clean$prelim

## Users can save the these dataframes outside of R as .csv files.
## The "review_df.csv" is then used to review the groupID or authorID 
## assignments and make any necessary corrections. 
## The function "authors_refine" is used to load and merge the changes 
## into R and create a dataframe used for analyses.

ropensci/refsplitr documentation built on June 9, 2025, 3:51 a.m.

ropensci/refsplitr index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

ropensci/refsplitr
author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data

authors_clean: Seperates author information in references files from...
In ropensci/refsplitr: author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data

Seperates author information in references files from `references_read`

Description

Usage

Arguments

Details

Examples

Related to authors_clean in ropensci/refsplitr...

R Package Documentation

Browse R Packages

We want your feedback!

ropensci/refsplitr author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data

authors_clean: Seperates author information in references files from... In ropensci/refsplitr: author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data

Seperates author information in references files from references_read

Description

Usage

Arguments

Details

Examples

Related to authors_clean in ropensci/refsplitr...

R Package Documentation

Browse R Packages

We want your feedback!

ropensci/refsplitr
author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data

authors_clean: Seperates author information in references files from...
In ropensci/refsplitr: author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data

Seperates author information in references files from `references_read`