Skip to main content

Questions tagged [temporal-difference]

Filter by
Sorted by
Tagged with
1 vote
0 answers
38 views

Context: I have a data set based around 16 different locations. Each location has a contaminant value measured once per year, from 2012 to 2023. The data looks something like this: Location Type Year ...
User493461's user avatar
1 vote
0 answers
94 views

I am conducting my master's thesis on the temporal patterns of spore production in two specific species and the environmental drivers associated with these patterns. I began with a visual analysis of ...
Lara Wüthrich's user avatar
1 vote
1 answer
187 views

I am working with time-series data, where each day is represented by a CSV file containing a 24×25 grid, with each entry acting as a pixel. I have generated ACF and PACF to understand the temporal ...
the_tomato's user avatar
1 vote
1 answer
128 views

I have a dataset with 80 species which were sampled in about 120 water bodies at two time periods (historical / recent). Only presence/absence of the species in each water body is considered. The data ...
Friede's user avatar
  • 11
1 vote
1 answer
68 views

I collected animal samples without replacement over three time periods from the same locality: seven years apart in the deep-sea (eg, no known seasonality). I want to know whether the mean difference ...
halfaxa's user avatar
  • 11
2 votes
1 answer
141 views

So, expected SARSA defines the update as: $$ Q(s,a) = Q(s,a) +\alpha (R+ \mathbb{E}_{a\sim\pi(s')}[Q(s', a)] - Q(s,a)) $$ Where SARSA defines the update as $a'\sim\pi(s')$: $$ Q(s,a) = Q(s,a) +\alpha (...
Alberto's user avatar
  • 1,561
2 votes
0 answers
201 views

I have been going through "Sutton & Barto Book: Reinforcement Learning: An Introduction", and in "Chapter 11: Off-policy Methods with Approximation", Example 11.1 briefly ...
Emre Y.'s user avatar
  • 21
2 votes
1 answer
97 views

I'm looking for appropriate types of analysis for a data set that contains counts of different crab species across 4 sites with 3 replicates per site (12 in total) over a time period of 1.5 years - 5 ...
Susanne Bähr's user avatar
1 vote
0 answers
58 views

This is a two-part question that first asks how to query some data I have in R, and secondly, asks what might be the appropriate statistical operations to test any perceived relationships between ...
Wangana's user avatar
  • 153
0 votes
0 answers
53 views

I'm trying to define a model for comparing temporal intervals, and can't find an appropriate distance function that incorporates the different relations two temporal intervals might have, e.g. overlap,...
Qais Abou Housien's user avatar
1 vote
0 answers
1k views

I am trying to understand how LSTD works in value function approximation. I am reading the preliminaries of this paper. I sort of understand how the LSTD method differs from TD learning. In TD ...
calveeen's user avatar
  • 1,136
0 votes
1 answer
2k views

So I've been trying to work out exactly what temporal leakage is for a while now and I'm getting nowhere. I'm not necessarily looking to code or anything, I'm more so interested in what it actually is ...
Fluffyrox4's user avatar
1 vote
1 answer
2k views

I want to compare changes in a variable that occur over time spans of different duration. Here is a hypothetical example: Precipitation in Region A decreased by 300 mm (from 1000 mm to 700 mm) over a ...
M_S's user avatar
  • 111
2 votes
1 answer
749 views

I'm a robotic engineer who's relatively new to reinforcement learning and I want to try to do simple reinforcement learning on a robot to optimize its velocity. I am however having trouble with ...
Mr_Melon's user avatar
6 votes
1 answer
2k views

When I learn reinforcement learning from David Silver's online video, I saw "the objective of TD learning, $r_t + \gamma V(s_{t+1})$ is a biased target for learning value function. " I know the ...
DiveIntoML's user avatar
  • 2,103

15 30 50 per page