knitr::opts_chunk$set( collapse = TRUE, comment = "#>" )
library(qryflow)
qryflow
was designed to be easily extended and allows users to define custom chunk types. This vignette provides relevant background knowledge on how qryflow
works under the hood, then walks through how to create and register custom chunk types.
This vignette assumes the knowledge found in the "Getting Started" (vignette("getting-started", package = "qryflow")
) and "Advanced Usage" (vignette("advanced-qryflow", package = "qryflow")
) vignettes.
qryflow
WorksWhen you run a SQL script using qryflow
, the process follows these steps:
Split the SQL script into chunks using tagged comments (e.g., -- @query: name
)
Identify the type of each chunk
Parse each chunk into a structured object using a type-specific parser
Execute each chunk using a type-specific handler
To support a new chunk type, you'll need to:
Create a parser — which converts a chunk of raw SQL into a structured qryflow_chunk
object.
Create a handler — which defines how to execute the chunk and return results.
Register your new type with qryflow
so the package knows how to process it.
Each chunk type needs to have an associated parser and an associated handler. This section outlines what arguments the custom parser and handler functions need to accept, what operations they should perform, and what results they should return.
The parser accepts the unaltered SQL statement of the chunk and should return a formal qryflow_chunk
object. Parsers will only be executed on a single chunk at a time. The sqflow_chunk
object is essentially a list in the form of:
list( type = type, name = name, sql = sql, tags = tags, results = results # Not populated until after execution )
The parser must extract the type, name, main SQL statement, and any additional tags. The package provides some helpers to do this. As an example, here is what the "exec" type parser looks like, with informative comments:
qryflow_exec_parser <- function(x, ...) { # Get a character vector containing each line lines <- read_sql_lines(x) # Identify any formal qryflow tags all_tags <- extract_all_tags(lines) # Check to see if an explicit name tag exists for the chunk name <- all_tags$name # If an explicit name tag doesn't exist, use the value from the "exec" tag # This is how aliases are implemented if(is.null(name)) { name <- all_tags[["exec"]] } # Find any additional tags beyond alias, name, or type tags <- subset_tags(all_tags, c("exec", "name", "type"), negate = TRUE) # Collapse the SQL into a single character sql_txt <- collapse_sql_lines(lines[!is_tag_line(lines)]) # Return the structured object new_qryflow_chunk(type = "exec", name = name, sql = sql_txt, tags = tags) }
Note: The package includes helper functions like read_sql_lines()
, extract_all_tags()
, subset_tags()
, is_tag_line()
, and others to make parsing easier.
Handlers accepts both a qryflow_chunk
object and a database connection object (e.g., DBI::dbConnect
). They should execute the SQL as appropriate and then return the result:
This is the handler for the "exec" type:
qryflow_exec_handler <- function(chunk, con, ...) { # Pass the SQL of the chunk to desired execution strategy result <- DBI::dbExecute(con, chunk$sql, ...) # Return the result result }
After a custom parser and handler have been created, they need to be registered.
qryflow
provides two functions to test whether parser and handler functions meet the specifications. An error will occur if:
The object is not a function
The formal arguments are not included
The formal arguments are not in the right order
validate_qryflow_parser(qryflow_exec_parser) validate_qryflow_handler(qryflow_exec_handler)
Note: These obviously do not test that the code within your function is correct nor does it test what output each function is expected to produce.
qryflow
maintains two internal environments:
.qryflow_parsers
for chunk parsers
.qryflow_handlers
for chunk handlers
When the package is loaded, default types like "query
" and "exec
" are automatically registered. You can register additional types using:
# Register separately register_qryflow_parser("custom", my_custom_parser_func, overwrite = TRUE) register_qryflow_handler("custom", my_custom_handler_func, overwrite = TRUE) # Or register both at once register_qryflow_type( "custom", parser = my_custom_parser_func, handler = my_custom_handler_func, overwrite = TRUE )
We can access what types are registered:
ls_qryflow_parsers() ls_qryflow_handlers() ls_qryflow_types()
Custom types must be re-registered each session. To make them persistent, add registration calls to your .Rprofile
(see: Managing R Startup), or create a small package with an .onLoad()
hook (see: R Packages (2e)).
query-send
Chunk TypeThis example shows how to implement a new chunk type that's similar to exec
and query
. We will create a new type, called query-send
that works like query
except calls DBI::dbSendQuery
instead of DBI::dbGetQuery
.
First, create the parser and handler:
query_send_parser <- function(x, ...) { # Convert to individual lines lines <- read_sql_lines(x) all_tags <- extract_all_tags(lines) # Check for explicit name name <- all_tags$name if (is.null(name)) { # Accomodate Aliased Approach name <- all_tags[["query-send"]] } other_tags <- subset_tags(all_tags, c("query-send", "name", "type"), negate = TRUE) sql_txt <- paste0(lines[!is_tag_line(lines)], collapse = "\n") new_qryflow_chunk(type = "query-send", name = name, sql = sql_txt, tags = other_tags) } query_send_handler <- function(chunk, con, ...){ res <- DBI::dbSendQuery(con, chunk$sql, ...) results <- DBI::dbFetch(res) DBI::dbClearResult(res) results }
Then, register them:
register_qryflow_type( "query-send", parser = query_send_parser, handler = query_send_handler, overwrite = TRUE )
Check that they registered properly:
ls_qryflow_types()
And test it out on some SQL:
# Creates an in-memory sqlite database and populates it with an mtcars table, named "mtcars" con <- example_db_connect(mtcars) # Create sql <- " -- @query-send: df_mtcars SELECT * FROM mtcars; " results <- qryflow(sql, con) head(results)
include
Chunk TypeWant to try extending qryflow
on your own?
Implement a custom "include
" chunk type that loads SQL from a separate file and injects it as a chunk.
For example, take the following statement:
-- @include: mysql.sql -- @query: my_df SELECT * FROM table
Assuming mysql.sql
contains:
-- @query: mysql SELECT * INTO table FROM another_table
How would you create the parser and handler so that functionally, what gets returned is:
-- @query: mysql SELECT * INTO table FROM another_table -- @query: my_df SELECT * FROM table
A word of caution: this may be trickier than you think given how chunks are currently implemented.
DBI::dbDisconnect(con)
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.