[Security] [AI assistant] setup/cleanup indices for evaluations #217078

KDKHD · 2025-04-03T17:10:32Z

Summary

Summarize your PR. If it involves visual changes include a screenshot or gif.

Setup indices and datastreams for evaluations. This will be used for ESQL evals and can be extended to setup other indices for other graphs.

How to test:

Enable the evaluations feature flag in kibana.dev.yml

xpack.securitySolution.enableExperimental: ['assistantModelEvaluation']

Launch Kibana
Go to evaluations http://localhost:5601/app/management/kibana/securityAiAssistantManagement?tab=evaluation
Start evaluations for the default assistant graph

Go to discover -> create a dataview
Search for *evaluations* and check there are datastreams and indices

8. These indices and datastreams are not cleaned up after the evaluation finishes. However, they are cleaned up when evaluations are re-run. To test this, run the evaluation again and see new datastreams and indices created. We can not do the cleanup after evaluations finish because evaluations happen asynchronously.

Checklist

Check the PR satisfies following conditions.

Reviewers should verify this PR satisfies this list as well.

Any text added follows EUI's writing guidelines, uses sentence case text and includes i18n support
Documentation was added for features that require explanation or tutorials
Unit or functional tests were updated or added to match the most common scenarios
If a plugin configuration key changed, check if it needs to be allowlisted in the cloud and added to the docker list
This was checked for breaking HTTP API changes, and any breaking changes have been approved by the breaking-change committee. The release_note:breaking label should be applied in these situations.
Flaky Test Runner was used on any tests changed
The PR description includes the appropriate Release Notes section, and the correct release_note:* label is applied per the guidelines

Identify risks

Does this PR introduce any risks? For example, consider risks like hard to test bugs, performance regression, potential of data loss.

Describe the risk, its severity, and mitigation for each identified risk. Invite stakeholders and evaluate how to proceed before merging.

See some risk examples
...

KDKHD · 2025-04-03T18:09:00Z

@elasticmachine merge upstream

KDKHD · 2025-04-07T08:35:50Z

@elasticmachine merge upstream

patrykkopycinski

LGTM

x-pack/solutions/security/plugins/elastic_assistant/server/routes/evaluate/post_evaluate.ts

...routes/evaluate/prepare_indices_for_evaluations/graph_type/assistant/index_requests/index.ts

…tes/evaluate/post_evaluate.ts Co-authored-by: Patryk Kopyciński <contact@patrykkopycinski.com>

…-fix'

kibanamachine · 2025-04-07T11:22:44Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/14307905734

elasticmachine · 2025-04-07T11:23:24Z

💚 Build Succeeded

Buildkite Build
Commit: 5ec7bf6

Metrics [docs]

✅ unchanged

History

💔 Build #290627 failed 66cb9de
💛 Build #290047 was flaky e657366

cc @patrykkopycinski

kibanamachine · 2025-04-07T11:28:55Z

💔 All backports failed

Status	Branch	Result
❌	8.x	Backport failed because of merge conflicts

Manual backport

To create the backport manually run:

node scripts/backport --pr 217078

Questions ?

Please refer to the Backport tool documentation

…tic#217078) ## Summary Summarize your PR. If it involves visual changes include a screenshot or gif. Setup indices and datastreams for evaluations. This will be used for ESQL evals and can be extended to setup other indices for other graphs. How to test: 1. Enable the evaluations feature flag in kibana.dev.yml ``` xpack.securitySolution.enableExperimental: ['assistantModelEvaluation'] ``` 2. Launch Kibana 4. Go to evaluations http://localhost:5601/app/management/kibana/securityAiAssistantManagement?tab=evaluation 5. Start evaluations for the default assistant graph <img width="1840" alt="image" src="https://github.com/user-attachments/assets/2974b34f-40a7-4300-8294-d25d4f72b27e" /> 6. Go to discover -> create a dataview 7. Search for `*evaluations*` and check there are datastreams and indices <img width="1840" alt="image" src="https://github.com/user-attachments/assets/b6e9e476-82de-4292-9757-487ac85d7fce" /> 8. These indices and datastreams are not cleaned up after the evaluation finishes. However, they are cleaned up when evaluations are re-run. To test this, run the evaluation again and see new datastreams and indices created. We can not do the cleanup after evaluations finish because evaluations happen asynchronously. ### Checklist Check the PR satisfies following conditions. Reviewers should verify this PR satisfies this list as well. - [X] Any text added follows [EUI's writing guidelines](https://elastic.github.io/eui/#/guidelines/writing), uses sentence case text and includes [i18n support](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md) - [X] [Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html) was added for features that require explanation or tutorials - [X] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios - [X] If a plugin configuration key changed, check if it needs to be allowlisted in the cloud and added to the [docker list](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker) - [X] This was checked for breaking HTTP API changes, and any breaking changes have been approved by the breaking-change committee. The `release_note:breaking` label should be applied in these situations. - [X] [Flaky Test Runner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1) was used on any tests changed - [X] The PR description includes the appropriate Release Notes section, and the correct `release_note:*` label is applied per the [guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process) ### Identify risks Does this PR introduce any risks? For example, consider risks like hard to test bugs, performance regression, potential of data loss. Describe the risk, its severity, and mitigation for each identified risk. Invite stakeholders and evaluate how to proceed before merging. - [ ] [See some risk examples](https://github.com/elastic/kibana/blob/main/RISK_MATRIX.mdx) - [ ] ... --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Patryk Kopyciński <contact@patrykkopycinski.com> (cherry picked from commit 54094bd) # Conflicts: # x-pack/solutions/security/plugins/elastic_assistant/server/routes/evaluate/post_evaluate.ts

KDKHD · 2025-04-07T11:38:35Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…#217078) (#217311) # Backport This will backport the following commits from `main` to `8.x`: - [[Security] [AI assistant] setup/cleanup indices for evaluations (#217078)](#217078)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sorenlouv/backport)  --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

setup/cleanup indices for evaluations

Verified

This commit was signed with the committer’s verified signature.

KDKHD Kenneth Kreindler

GPG key ID: 429CB8689E46A00B

Verified
Learn about vigilant mode

Loading
Loading status checks…

9232eb0

KDKHD changed the title ~~setup/cleanup indices for evaluations~~ [Security] [AI assistant] setup/cleanup indices for evaluations Apr 3, 2025

KDKHD added Team:Security Generative AI backport:version v9.1.0 v8.19.0 release_note:skip labels Apr 3, 2025

KDKHD marked this pull request as ready for review April 3, 2025 17:16

KDKHD requested a review from a team as a code owner April 3, 2025 17:17

KDKHD added 2 commits April 3, 2025 18:18

setup/cleanup indices for evaluations

Verified

This commit was signed with the committer’s verified signature.

KDKHD Kenneth Kreindler

GPG key ID: 429CB8689E46A00B

Verified
Learn about vigilant mode

Loading
Loading status checks…

e224e31

setup/cleanup indices for evaluations

Verified

This commit was signed with the committer’s verified signature.

KDKHD Kenneth Kreindler

GPG key ID: 429CB8689E46A00B

Verified
Learn about vigilant mode

Loading
Loading status checks…

aa63434

Merge branch 'main' into enhancement/setup_evaluation_indices

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

e657366

KDKHD assigned patrykkopycinski Apr 4, 2025

Merge branch 'main' into enhancement/setup_evaluation_indices

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

14008f2

KDKHD requested a review from patrykkopycinski April 7, 2025 08:36

patrykkopycinski approved these changes Apr 7, 2025

View reviewed changes

KDKHD and others added 3 commits April 7, 2025 09:43

typo

Verified

This commit was signed with the committer’s verified signature.

KDKHD Kenneth Kreindler

GPG key ID: 429CB8689E46A00B

Verified
Learn about vigilant mode

Loading
Loading status checks…

66cb9de

KDKHD enabled auto-merge (squash) April 7, 2025 08:54

kibanamachine and others added 3 commits April 7, 2025 09:17

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

Loading
Loading status checks…

989d99a

…-fix'

typo

Verified

This commit was signed with the committer’s verified signature.

KDKHD Kenneth Kreindler

GPG key ID: 429CB8689E46A00B

Verified
Learn about vigilant mode

13eb0a7

typo

Verified

This commit was signed with the committer’s verified signature.

KDKHD Kenneth Kreindler

GPG key ID: 429CB8689E46A00B

Verified
Learn about vigilant mode

Loading
Loading status checks…

5ec7bf6

KDKHD merged commit 54094bd into elastic:main Apr 7, 2025
10 checks passed

KDKHD mentioned this pull request Apr 7, 2025

[8.x] [Security] [AI assistant] setup/cleanup indices for evaluations (#217078) #217311

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security] [AI assistant] setup/cleanup indices for evaluations #217078

[Security] [AI assistant] setup/cleanup indices for evaluations #217078

KDKHD commented Apr 3, 2025 •

edited by kibanamachine

Loading

KDKHD commented Apr 3, 2025

KDKHD commented Apr 7, 2025

patrykkopycinski left a comment

kibanamachine commented Apr 7, 2025

elasticmachine commented Apr 7, 2025

kibanamachine commented Apr 7, 2025

KDKHD commented Apr 7, 2025

[Security] [AI assistant] setup/cleanup indices for evaluations #217078

[Security] [AI assistant] setup/cleanup indices for evaluations #217078

Conversation

KDKHD commented Apr 3, 2025 • edited by kibanamachine Loading

Summary

Checklist

Identify risks

KDKHD commented Apr 3, 2025

KDKHD commented Apr 7, 2025

patrykkopycinski left a comment

Choose a reason for hiding this comment

kibanamachine commented Apr 7, 2025

elasticmachine commented Apr 7, 2025

💚 Build Succeeded

Metrics [docs]

History

kibanamachine commented Apr 7, 2025

💔 All backports failed

Manual backport

Questions ?

KDKHD commented Apr 7, 2025

💚 All backports created successfully

Questions ?

KDKHD commented Apr 3, 2025 •

edited by kibanamachine

Loading