CI: shift core-library tests to a separate "job" by mabruzzo · Pull Request #283 · grackle-project/grackle

mabruzzo · 2025-03-26T04:19:05Z

This PR shifts the core-library tests to a separate "job."

It has come to my attention in recent PRs (e.g. #273, #279), that our current choice to run the core-library test-suite after the Pygrackle test-suite is somewhat undesirable. Specifically, the core-library test-suite is more likely to include unit-tests (while there may not be many unit-tests yet, many forthcoming PRs have proposed adding them).

This is important for 2 reasons:

Unit-tests are inherently faster than answer-tests. We should run these as soon as possible so we can fail quickly.
Unit-tests are also more specialized than answer-tests. If a unit-test fails, it is likely that an answer-test will also fail. The way things have been configured (before this PR) will cause the answer-test to fail and CI to abort before the unit-tests can be reun.

As I write this up, I realize that I probably just could have re-ordered the tests. I'm totally willing to do that. But, I think in some ways this may be better (if we aren't too worried about using up resources)

This PR shifts the core-library tests to a separate "job." It has come to my attention in recent PRs (e.g. grackle-project#273, grackle-project#279), that our current choice to run the core-library test-suite after the Pygrackle test-suite is somewhat undesirable. Specifically, the core-library test-suite is more likely to include unit-tests (while there may not be many unit-tests yet, many forthcoming PRs have proposed adding them). This is important for 2 reasons: 1. Unit-tests are inherently faster than answer-tests. We should run these as soon as possible so we can fail quickly. 2. Unit-tests are also more specialized than answer-tests. If a unit-test fails, it is likely that an answer-test will also fail. The way things have been configured (before this PR) will cause the answer-test to fail and CI to abort before the unit-tests can be reun. As I write this up, I realize that I probably just could have re-ordered the tests. I'm totally willing to do that. But, I think in some ways this may be better (if we aren't too worried about using up resources)

…ion flags seem to make a difference.

… script

I realized that the running the code on MacOS machine with CMAKE_BUILD_TYPE=Release and CMAKE_BUILD_TYPE=Debug produces slightly different answers. The flags used in these cases are: - Release: -O3 - Debug: -g This is fairly disturbing since it should not make a difference (-O3 does not enable any floating point optimizations). While I haven't tested it, I'm **extremely** confident that this isn't a CMake issue. But I'm not really sure what we can do... Let's keep track of this

brittonsmith

This looks good. I had one minor comment. Feel free to deal with that and merge. Only slightly related to this, one thing I don't like is that we run tests with the classic build system in generate mode prior to creating the gold standard. It probably had marginal benefit, but I think it would make sense for gold standard generation to happen first and for as much of the testing as is possible to run in compare mode.

Co-authored-by: Britton Smith <brittonsmith@gmail.com>

mabruzzo · 2025-04-02T13:14:45Z

Only slightly related to this, one thing I don't like is that we run tests with the classic build system in generate mode prior to creating the gold standard. It probably had marginal benefit, but I think it would make sense for gold standard generation to happen first and for as much of the testing as is possible to run in compare mode.

I 100% agree. I've been tempted to change this a couple of times. But, I was previously under the false impression that you had picked the current order (I now suspect that the current order originated from a typo OR the fact that you and I were making large changes to the answer testing framework around the same time as each other)

brittonsmith · 2025-04-02T13:50:18Z

I also suspect this was simply a typo.

mabruzzo · 2025-04-03T17:17:57Z

As we discussed offline, I adjusted the continuous integration so that now run tests with the classic build system in compare-mode AFTER we record the test-answers.

While I doing this I made 3 really minor tweaks:

I fixed a typo in the name of one of the new CI "steps"
I made sure the CI will explicitly fail if we make a particular mistake when modify config.yml (we were already printing an error message for this case)

More importantly, I caught a bug where we weren't properly running the gold-standard in compare mode (this was due to a typo on my part in a previous PR).

While trying to fix that bug, I found and fixed a separate minor bug that was messing with string comparisons when we use circle-ci's parameters (as luck would have it, I actually think everything was previously working as intended). The fix was simple: add some quotes

mabruzzo added the ci Related to Continuous Integration label Mar 26, 2025

mabruzzo force-pushed the ci-corelibtests-job branch from 1d20c17 to 4693736 Compare March 26, 2025 04:26

update relative tolerance (its a little disturbing that the optimizat…

c731804

…ion flags seem to make a difference.

brittonsmith added this to the 3.4 milestone Mar 26, 2025

mabruzzo added 2 commits March 30, 2025 18:33

improved the clarity of testing reporting by the code_example_checker…

9ac7ef0

… script

brittonsmith approved these changes Apr 1, 2025

View reviewed changes

Comment thread tests/scripts/code_example_checker.py Outdated

Update tests/scripts/code_example_checker.py

d705328

Co-authored-by: Britton Smith <brittonsmith@gmail.com>

mabruzzo added 6 commits April 3, 2025 11:31

Merge branch 'main' into ci-corelibtests-job

b5f1769

adopt a slightly more meaningful testing order for pygrackle suite.

7bebd84

fix a few minor bugs in .circleci/config.yml

be85e16

cleanup description of some circleci tasks

077438f

debugging commit

49f57d2

I think I fixed it

6770277

mabruzzo merged commit 0405d21 into grackle-project:main Apr 3, 2025

mabruzzo deleted the ci-corelibtests-job branch April 3, 2025 17:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CI: shift core-library tests to a separate "job"#283

CI: shift core-library tests to a separate "job"#283
mabruzzo merged 11 commits into
grackle-project:mainfrom
mabruzzo:ci-corelibtests-job

mabruzzo commented Mar 26, 2025

Uh oh!

brittonsmith left a comment

Uh oh!

Uh oh!

mabruzzo commented Apr 2, 2025

Uh oh!

brittonsmith commented Apr 2, 2025

Uh oh!

mabruzzo commented Apr 3, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

mabruzzo commented Mar 26, 2025

Uh oh!

brittonsmith left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mabruzzo commented Apr 2, 2025

Uh oh!

brittonsmith commented Apr 2, 2025

Uh oh!

mabruzzo commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mabruzzo commented Apr 3, 2025 •

edited

Loading