Schedule an R function or formula to run after a specified period of
time. Similar to JavaScript’s setTimeout
function. Like JavaScript, R
is single-threaded so there’s no guarantee that the operation will run
exactly at the requested time, only that at least that much time will
elapse.
To avoid bugs due to reentrancy, by default, scheduled operations only
run when there is no other R code present on the execution stack; i.e.,
when R is sitting at the top-level prompt. You can force past-due
operations to run at a time of your choosing by calling
later::run_now()
.
The mechanism used by this package is inspired by Simon Urbanek’s background package and similar code in Rhttpd.
You can install the development version of later with:
pak::pak("r-lib/later")
Pass a function (in this case, delayed by 5 seconds):
later::later(function() {
print("Got here!")
}, 5)
Or a formula (in this case, run as soon as control returns to the top-level):
later::later(~print("Got here!"))
It is also possible to have a function run based on when file descriptors are ready for reading or writing, at some indeterminate time in the future.
Below, a logical vector is printed indicating which of file descriptors 21 or 22 were ready, subject to a timeout of 1s. Instead of just printing the result, the supplied function can also do something more useful such as reading from the descriptor.
later::later_fd(print, c(21L, 22L), timeout = 1)
This is useful in particular for asynchronous or streaming data transfer
over the network / internet, so that reads can be made from TCP sockets
as soon as data is available. later::later_fd()
pairs well with
functions such as curl::multi_fdset()
that return the relevant file
descriptors to be monitored .
You can also call later::later
from C++ code in your own packages, to
cause your own C-style functions to be called back. This is safe to call
from either the main R thread or a different thread; in both cases, your
callback will be invoked from the main R thread.
later::later
is accessible from later_api.h
and its prototype looks
like this:
void later(void (*func)(void*), void* data, double secs)
The first argument is a pointer to a function that takes one void*
argument and returns void. The second argument is a void*
that will be
passed to the function when it’s called back. And the third argument is
the number of seconds to wait (at a minimum) before invoking.
later::later_fd
is also accessible from later_api.h
and its
prototype looks like this:
void later_fd(void (*func)(int *, void *), void *data, int num_fds, struct pollfd *fds, double secs)
The first argument is a pointer to a function that takes two arguments:
the first being an int*
array provided by later_fd()
when called
back, and the second being a void*
. The int*
array will be the
length of num_fds
and contain the values 0
, 1
or NA_INTEGER
to
indicate the readiness of each file descriptor, or an error condition
respectively. The second argument data
is passed to the void*
argument of the function when it’s called back. The third is the total
number of file descriptors being passed, the fourth a pointer to an
array of stuct pollfds
, and the fifth the number of seconds to wait
until timing out.
To use the C++ interface, you’ll need to add later
to your
DESCRIPTION
file under both LinkingTo
and Imports
, and also make
sure that your NAMESPACE
file has an import(later)
entry.
Finally, this package also offers a higher-level C++ helper class to
make it easier to execute tasks on a background thread. It is also
available from later_api.h
and its public/protected interface looks
like this:
class BackgroundTask {
public:
BackgroundTask();
virtual ~BackgroundTask();
// Start executing the task
void begin();
protected:
// The task to be executed on the background thread.
// Neither the R runtime nor any R data structures may be
// touched from the background thread; any values that need
// to be passed into or out of the Execute method must be
// included as fields on the Task subclass object.
virtual void execute() = 0;
// A short task that runs on the main R thread after the
// background task has completed. It's safe to access the
// R runtime and R data structures from here.
virtual void complete() = 0;
}
Create your own subclass, implementing a custom constructor plus the
execute
and complete
methods.
It’s critical that the code in your execute
method not mutate any R
data structures, call any R code, or cause any R allocations, as it will
execute in a background thread where such operations are unsafe. You
can, however, perform such operations in the constructor (assuming you
perform construction only from the main R thread) and complete
method.
Pass values between the constructor and methods using fields.
#include <Rcpp.h>
#include <later_api.h>
class MyTask : public later::BackgroundTask {
public:
MyTask(Rcpp::NumericVector vec) :
inputVals(Rcpp::as<std::vector<double> >(vec)) {
}
protected:
void execute() {
double sum = 0;
for (std::vector<double>::const_iterator it = inputVals.begin();
it != inputVals.end();
it++) {
sum += *it;
}
result = sum / inputVals.size();
}
void complete() {
Rprintf("Result is %f\n", result);
}
private:
std::vector<double> inputVals;
double result;
};
To run the task, new
up your subclass and call begin()
,
e.g. (new MyTask(vec))->begin()
. There’s no need to keep track of the
pointer; the task object will delete itself when the task is complete.
// [[Rcpp::export]]
void asyncMean(Rcpp::NumericVector data) {
(new MyTask(data))->begin();
}
It’s not very useful to execute tasks on background threads if you can’t get access to the results back in R. The promises package complements later by providing a “promise” abstraction.