You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm not too sure if this issue sits within dbplyr or RSQLite, but the slice_sample() function does not appear to be taking account of a seed set via set.seed() when using an SQLite database.
If I set a seed, then run slice_sample(), then later set the same seed, and run slice_sample() again, the sampled table is different.
For comparison, when undertaking these sample operations using a Microsoft SQL Server database, the expected behaviour is observed (i.e., the two sampled tables are identical).
If this is out of the control of R packages (for example, if the issue is with SQLite itself), would it be possible to return a warning to inform the user?
set.seed() can't affect the database (how could it as it doesn't even have any argument related to the database). This is true for SQLite and SQL Server and any other database. If slice_sample() consistently samples the same rows for SQL Server this would be a bug.
A warning doesn't make much sense here and would mostly annoy users.
I'm not too sure if this issue sits within
dbplyr
orRSQLite
, but theslice_sample()
function does not appear to be taking account of a seed set viaset.seed()
when using an SQLite database.If I set a seed, then run
slice_sample()
, then later set the same seed, and runslice_sample()
again, the sampled table is different.For comparison, when undertaking these sample operations using a Microsoft SQL Server database, the expected behaviour is observed (i.e., the two sampled tables are identical).
The text was updated successfully, but these errors were encountered: