json2lucene: Function used for migrating tweets from to old to the new...
In epitweetr: Early Detection of Public Health Threats from 'Twitter' Data

json2lucene

R Documentation

Function used for migrating tweets from to old to the new file system

Description

migrates geolocated tweets from the old to the new file system allowing full text search using Apache Lucene Indexes

Usage

json2lucene(tasks = get_tasks(), chunk_size = 400)

Arguments

`tasks`	named list, current tasks for logging and updating progress default: get_tasks()
`chunk_size,`	integer, the chunk size for indexing tweets, default: 400

Details

This function can be called manually to perform the migration of tweets between v0.0.x to v2+ It iterates over existing tweets collected with epitweetr v0.0.x series joins base tweets and geolocated tweets and then sends themes to the Lucene index via the dedicated REST API. Migrated files will be moved to search_archive and geo_archive folders. Users can backup and remove these folders when migration ends to gain disk space. Series folders are maintained for migrated tweets

Value

the updated tasks.

Examples

if(FALSE){
   library(epitweetr)
   # setting up the data folder
   message('Please choose the epitweetr data directory')
   setup_config(file.choose()) 
   # runnint the migration
   json2lucene()
}

epitweetr documentation built on Nov. 16, 2023, 5:07 p.m.