-
-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consider nested covariance structures? #174
Comments
Very interesting point, I wonder how sas handles this? This reminds me of the discussion we had a while ago on the relation between MMRM and Gaussian Processes. The Kronecker idea is (I think) equivalent to something that is quite common in the GP literature of composing elementary covariance functions to model more complex structures, see e.g. https://www.cs.toronto.edu/~duvenaud/cookbook/. A population-marginal MMRM is (haven't x-ed it out yet) equivalent to a GP with mean structure (usually the mean structure is not modelled in GP literature) and a covariance function for the Gaussian residuals. The "usual" covariance structures are all stationary covariance functions for a discrete metric on the space of visits. I.e. the covariance only depends on the visits through the number of visits between observations, not the actual timings. The "spatial" covariance structures relax this by depending on the actual time difference. From a covariance function perspective the problem of multiple observations per individual/visit arises because the difference (in either type of structure, spatial or not) being zero, hence the observations would be perfectly correlated and cannot differ (singular vcov matrix). Numerically the fix is to add a diagnoal to the vcov matrix. In covariance function terms that would be the composition of the original kernel with a "nugget" kernel. The Kronecker trick is (I think) nothing but the product of two covariance functions (see e.g. http://gaussianprocess.org/gpml/chapters/RW4.pdf or the kernel cookbook link above again). Here it seems more natural to add kernels though. If mmrm were to support such compositions (product and weighted sum), this would also allow to perform implicit covariance structure selection if the weights of a sum of covariance structures are also estimated. Probably hard to identify well in practice though. |
Let us assume we have two measurements, A and B, per visit per patient. Then Daniel suggested to write each individual-block in the overall covariance matrix as (assuming there are only two visits present in the data): So a very useful starting point would be to allow the user to specify, for example, an ante-dependence structure for each block in the image above. |
Yes, I t agree with that. My point is more about implementation. If we worked with covariance functions under the hood, there would not need to be a special case for multiple observations. The covariance could depend on the visit difference (see refs above). Then, in your notation, the "difference" between A_i and B_i is zero, hence the respective variance is returned. Now, to support heterogeneous variances, the covariance function would have to be non-stationary. Hm, what would be the signature of a general covariance function, probably cov(visit_1, visit_2, theta) where theta are the covariance function specific parameters. The advantage is that this formulation is completely independent of the sequence of observations. |
@kkmann Not sure if I understand what you write :-D But I guess you mean that we could formulate also the non-spatial covariance structures in a similar way as we do for the spatial covariance structures? And this could maybe be helpful for more complex covariance structures such as the nested one here? Can we maybe still also think about a lower bar option? E.g. if we still keep the non-spatial vs. spatial differentiation, how would we formulate a covariance structure for the nested case? e.g. the simple 2 visits, 2 subjects case that @garcia-mc has above. |
x) thanks for translating. I think we all agree that it is statistically fine to have multiple obs (possible in SAS?) we just need to define the variance. Implementing this via covariance functions just removes the need to manually align the blocks (you do not even have to order visit times). What exactly do you mean with "covariance structure for the nested case" then? To me a covariance structure is a function mapping from a set of parameters and a dimension (number of visits) to a covariance matrix like here: Line 10 in 0c5366a
A fancy way would be to define an abstract class "covariance function" and overload the "()" to be able to treat it like a function as well. One could then define sums and multiples of covariance functions. A covariance matrix is then generated on the fly from the respective covariance function (or a composition thereof). That would be pretty flexible and remove the need to sort anything or worry about duplicates. Otherwise this could quickly become an "index battle" x) |
@kkmann ok and if we had these sums and multiples of covariance functions, how would we use this for the nested use case above? can we write this down in math formula? Just for my brain to be able to parse it. |
I wonder, does it make sense to go via a |
Yeah design doc for this would be awesome! |
I hope this is not too time critical x) I'd need some muse to fiddle with this. |
@kkmann no worries, not time critical at all |
Got another question on this today (as part of our BBS workshop). So seems there is some interest in this. |
I would just like to express my interest in having the functionality of nested covariance structures available in the mmrm package. It appears that SAS implements this through the use of a Kronecker product covariance matrix:
|
Possibly related as precedent: chg ~ group * time + baseline
sigma ~ group * time + baseline where None of this is affects the correlation matrix, however. |
So recently we had a user who wanted to model multiple assessments per visit per subject. We currently cannot model this adequately with the existing covariance structure framework, because:
time = interaction(assessment, visit)
and usetime
What could be better "nested" covariance structures here? Basically we still want to keep independence between subjects (subject blocks in overall covariance matrix) because that allows us to scale the computations well with number of observations. On the other hand we would like to include additional structure within subjects.
I am wondering if the Kronecker idea (@clarkliming) could be useful here or whether @kkmann has other ideas.
The text was updated successfully, but these errors were encountered: