-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Search engine test #120
base: master
Are you sure you want to change the base?
Search engine test #120
Conversation
The notebook works as expected with a list of
When the list of genes is widened to
Then I am getting an error
The error is in the cell "Search using search engine" |
Thanks. This is probably due to the fact that for some genes no results are found. I will adjust that |
@pwalczysko fixed |
Thanks, that works. But further, for some reason, when a non-existing Gene is searched for, the test fails (should it ? one can argue that it should pass). The test fails with (see below). Note that I added print statements, which show that in the list there was a non-existing gene called I think it would be good either to make either
|
I will sort that out |
@pwalczysko fixed |
Thanks @jburel , the fix works fine when small number of genes is passed into the list. Now with a list such as I am getting persistently
|
The problem with the
Probably too much to print out. When I comment out this line, all works fine. |
@pwalczysko I have added the ability to load all the possible values for a given key. The values are sorted |
Thanks, works fine. I have for a search between 0 and 500
Does that mean that search_engine is returning 28 more search results than mapr ? Edit:
Note that for these long searches, mapr has some 42 minutes against 55 sec of search_engine. |
I will have to investigate |
Yes, I think that
means that there are more search_engine keys than the mapr keys. Wonder how could that be possible ? Edit: |
It seems to be more a problem with the logic. |
Tested genes between 0 and 1500. No mismatches, all looks good with the new commit (took something like 5 + 20 + 17 minutes on mapr step). |
Tested further 1500-3000, in 3 5-hundred strong batches. The test is passing in full, but the times for mapr can be even 40 mins for 500 genes search. I suppose that this is because there are more results for those genes. This means we have now 0 - 3000 tested. |
only 47000 to go :-) |
13000 (13 thousand) done as of today ;) |
Between 18501 - 19000 I got an error on the mapr cell execution (the search engine one returned fine)
Edit: This was an intermittent error, did not repeat on second run. |
@jburel now I am consistently getting a following error on the cell
|
Add notebook comparing search engine call and mapr call
cc @pwalczysko