Replies: 3 comments 14 replies
-
@roumail Thank you for your question and welcome to the STUMPY community.
Can you please elaborate on what you mean by this?
Also, in case it matters, we are currently working on adding a new feature to look for discords in your data but it won't be ready for a while. Please see PR #505 |
Beta Was this translation helpful? Give feedback.
-
@roumail @seanlaw
BUT,
I guess the matrix profile is working good here as it captures anomaly (which is subsequence not just a single time stamp). The left red line is one of the peaks of matrix profile that is between 3000 and 4000. That is just the start of subsequence. The end of this subsequence is the second red line which, I think, captures the peak as well. (1) Btw, you may want to try both (2) Also, if you choose a smaller window size (e.g. 250), I feel your matrix profile peaks should get closer to peak of time series. (however, you still cannot expect that the peak of matrix profile matches the peak of time series) |
Beta Was this translation helpful? Give feedback.
-
@seanlaw and @NimaSarajpoor - I have a somewhat unrelated question that I'm not sure where best to ask. At the risk of I am a data scientist in a global pharmaceutical company and we often invite internal/external speakers on topics of machine learning to foster knowledge sharing about new and exciting methods. We do this via an online meeting ~ 1 hour (15 minute questions) for a group of 20-30 people. Looking forward to hear back from you! |
Beta Was this translation helpful? Give feedback.
-
Hi, new to the matrix profile so sorry if this is obvious for others.. I've plotted a matrix profile and I'm looking to capture the values of the second and third peak from my "param" plot below. To be clear, these are the timepoints b/w 3000-5000. My sequence length here is about 500, which makes sense I guess.
The tutorial for Motif and Discord discovery talks about talking the argsort[-1] but for me that would be the output I get right now which isn't interest..
I thought about combining this together with "fluss" to remove the downward sloping half of my data and well, that didn't exactly work. I also thought to just convert the output matrix profile/index output to pandas and apply heuristics to identify max value in each subsequence. However at this point I think I'm overcomplicating this
What am I missing here? Any tips?
Beta Was this translation helpful? Give feedback.
All reactions