GroundTruth eval: API changes #1353

sfc-gh-dhuang · 2024-08-16T22:35:50Z

Description

JIRA: https://snowflakecomputing.atlassian.net/browse/SNOW-1622124
Design: https://docs.google.com/document/d/1T67nNWL08jmQ7_xBpxulMU1mosecGgTKqBakhgBf79A/edit?pli=1

ORM / DAO PR: #1348

Other details good to know for developers

Please include any other details of this change useful for TruLens developers.

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to
not work as expected)
New Tests
This change includes re-generated golden test results
This change requires a documentation update

…iment

review-notebook-app · 2024-08-17T05:13:29Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

src/core/trulens/core/tru.py

sfc-gh-chu · 2024-08-20T17:13:36Z

src/feedback/trulens/feedback/groundtruth.py

-                    agreement_txt, min_score_val=0, max_score_val=3
-                )
-                / 3,
+                re_0_10_rating(agreement_txt) / 10,


why change this back to 10pt scoring?

This is b/c the instructions in the agreement prompt (for feedback function GroundtruthAgreement.agreement_measure) hasn't been updated to take the recently added configurable output score space yet.

I will do that along with several other feedback prompts in a separate PR, and this change is here so that the e2e notebook can run GT eval successfully

src/core/trulens/core/tru.py

sfc-gh-dhuang · 2024-08-20T19:48:56Z

src/core/trulens/core/database/migrations/env.py

@@ -25,7 +25,7 @@
    fileConfig(config.config_file_name)

 # Get `sqlalchemy.url` from the environment.
-if config.get_main_option("sqlalchemy.url", None) in (None, ""):
+if config.get_main_option("sqlalchemy.url", None) is None:


this change is b/c I've merged in https://github.com/truera/trulens/pull/1355/files so we no longer need to consider the empty string

…for BEIR data loader + add docstring for beir_loader

src/core/trulens/core/tru.py

sfc-gh-pdharmana

Tq so much

* initial commit + fix type of ground_truth argument in benchmark_experiment * beir data loader impl * WIP ORM and persist API with chunking * wip with dataframe chunking * orm for groundtruth and dataset added * schema classes added for dataset and groundtruth * more CRUD code * more crud * separete tru changes * revert unwanted changes * schema fix * rm * move beir loader to its own change * schema update * tmp id handling * add BEIR dataset loader util * batch insertion of ground truth entries in tru sdk * wip * wip notebook test * add alembic new revision * add migration versions to data.py * remove ALTER column statement as it's not supported in SQLite * add autogenerated migration revision * add alembic new revision * remove ALTER column statement as it's not supported in SQLite * batch insertion of groundtruth entries more or less work * dataset use dataset_json just like gt * update revision * update api name * remove ts * added domain * nb update * better docstring * sdk renaming * renaming 'response' to 'expected_response' in GT eval * BEIR dataset loader WIP * nb * revisions * make data_path mandatory * beir done * adjust metadata in nb * remove domain * implement chunking * todo: refactor * no download cleanup zip * comment on expected_score * let groundtruth feedback handles pd df * api skeleton * pd concat * v1 working * speed up * fix groundtruth feedback * doc updates * more doc update * remove stuff * initial commit + fix type of ground_truth argument in benchmark_experiment * beir data loader impl * WIP ORM and persist API with chunking * wip with dataframe chunking * orm for groundtruth and dataset added * schema classes added for dataset and groundtruth * more CRUD code * more crud * separete tru changes * revert unwanted changes * schema fix * rm * move beir loader to its own change * schema update * tmp id handling * add autogenerated migration revision * dataset use dataset_json just like gt * update revision * remove ts * added domain * revisions * remove domain * let groundtruth feedback handles pd df * pd concat * v1 working * doc updates * more doc update * remove unused param * no more negative param * update * simplify name * improve batch insertion * remove unnecessary change in env.py * simplified and incorporating pr comments - no threads just in-memory * docstring * time-based to batch-size based * remove unnecessary dataset names and rely on the actual download URL for BEIR data loader + add docstring for beir_loader * pandas / pd

sfc-gh-dhuang added 15 commits August 13, 2024 18:41

initial commit + fix type of ground_truth argument in benchmark_exper…

02bac57

…iment

beir data loader impl

d4da0ff

WIP ORM and persist API with chunking

897105a

wip with dataframe chunking

6a4fb5a

orm for groundtruth and dataset added

6b64bb3

schema classes added for dataset and groundtruth

74d8ed2

more CRUD code

ff63d98

more crud

d20c75f

separete tru changes

99c9cd3

revert unwanted changes

9fc4d70

schema fix

8d302a0

rm

ecc5ad6

Merge branch 'main' into daniel/gt-dataset-persistence

211304b

Merge branch 'main' into daniel/gt-dataset-persistence

ab7bb82

move beir loader to its own change

2670f14

sfc-gh-dhuang changed the title ~~add BEIR dataset loader util~~ GroundTruth eval: API changes Aug 16, 2024

sfc-gh-dhuang mentioned this pull request Aug 16, 2024

GroundTruth dataset implementation (ORM and DAO layers) #1348

Merged

6 tasks

sfc-gh-dhuang added 6 commits August 16, 2024 22:08

schema update

71d5f0c

tmp id handling

979c117

add BEIR dataset loader util

0585206

batch insertion of ground truth entries in tru sdk

24276ec

wip

b576448

wip notebook test

74cee3d

sfc-gh-dhuang force-pushed the daniel/gt-api branch from 90bacee to 74cee3d Compare August 17, 2024 05:13

sfc-gh-dhuang added 5 commits August 16, 2024 22:49

add alembic new revision

5ec776d

add migration versions to data.py

4268ca1

remove ALTER column statement as it's not supported in SQLite

2faa4cf

add autogenerated migration revision

02c14cf

add alembic new revision

44b7dce

sfc-gh-pdharmana reviewed Aug 19, 2024

View reviewed changes

src/core/trulens/core/tru.py Outdated Show resolved Hide resolved

sfc-gh-dhuang added 4 commits August 19, 2024 16:35

update

3ca7385

simplify name

703469b

improve batch insertion

eacba09

Merge branch 'daniel/gt-dataset-persistence' into daniel/gt-api

6f3490b

sfc-gh-chu reviewed Aug 20, 2024

View reviewed changes

Base automatically changed from daniel/gt-dataset-persistence to main August 20, 2024 18:18

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:XL This PR changes 500-999 lines, ignoring generated files. labels Aug 20, 2024

Merge branch 'main' into daniel/gt-api

3a1af75

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Aug 20, 2024

sfc-gh-dhuang added 4 commits August 20, 2024 11:23

remove unnecessary change in env.py

fe18209

simplified and incorporating pr comments - no threads just in-memory

3eeba2e

Merge branch 'main' into daniel/gt-api

87d4f6d

docstring

dc3a5e0

sfc-gh-dhuang commented Aug 20, 2024

View reviewed changes

src/core/trulens/core/tru.py Show resolved Hide resolved

sfc-gh-dhuang requested review from sfc-gh-pdharmana and sfc-gh-chu August 20, 2024 19:16

sfc-gh-dhuang added 2 commits August 20, 2024 12:37

Merge branch 'main' into daniel/gt-api

5d9d7b4

time-based to batch-size based

7dd7858

sfc-gh-dhuang commented Aug 20, 2024

View reviewed changes

remove unnecessary dataset names and rely on the actual download URL …

9efda3a

…for BEIR data loader + add docstring for beir_loader

sfc-gh-pdharmana reviewed Aug 20, 2024

View reviewed changes

src/core/trulens/core/tru.py Outdated Show resolved Hide resolved

sfc-gh-pdharmana approved these changes Aug 20, 2024

View reviewed changes

pandas / pd

d32ed01

sfc-gh-dhuang merged commit 8110c11 into main Aug 20, 2024
7 checks passed

sfc-gh-dhuang deleted the daniel/gt-api branch August 20, 2024 20:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GroundTruth eval: API changes #1353

GroundTruth eval: API changes #1353

sfc-gh-dhuang commented Aug 16, 2024 •

edited

Loading

review-notebook-app bot commented Aug 17, 2024

sfc-gh-chu Aug 20, 2024

sfc-gh-dhuang Aug 20, 2024 •

edited

Loading

sfc-gh-dhuang Aug 20, 2024

sfc-gh-pdharmana left a comment

GroundTruth eval: API changes #1353

GroundTruth eval: API changes #1353

Conversation

sfc-gh-dhuang commented Aug 16, 2024 • edited Loading

Description

Other details good to know for developers

Type of change

review-notebook-app bot commented Aug 17, 2024

sfc-gh-chu Aug 20, 2024

Choose a reason for hiding this comment

sfc-gh-dhuang Aug 20, 2024 • edited Loading

Choose a reason for hiding this comment

sfc-gh-dhuang Aug 20, 2024

Choose a reason for hiding this comment

sfc-gh-pdharmana left a comment

Choose a reason for hiding this comment

sfc-gh-dhuang commented Aug 16, 2024 •

edited

Loading

sfc-gh-dhuang Aug 20, 2024 •

edited

Loading