Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Index supplementary files for AnVIL (#5000) #5059

Merged

Conversation

nadove-ucsc
Copy link
Contributor

@nadove-ucsc nadove-ucsc commented Mar 17, 2023

Connected issues: #5000

Checklist

Author

  • PR is a draft
  • Target branch is develop
  • Name of PR branch matches issues/<GitHub handle of author>/<issue#>-<slug>
  • PR title references all connected issues
  • PR title matches1 that of a connected issue or comment in PR explains why they're different
  • For each connected issue, there is at least one commit whose title references that issue
  • PR is connected to all connected issues via ZenHub
  • PR description links to connected issues
  • Added partial label to PR or this PR completely resolves all connected issues

1 when the issue title describes a problem, the corresponding PR
title is Fix: followed by the issue title

Author (reindex, API changes)

  • Added r tag to commit title or this PR does not require reindexing
  • Added reindex label to PR or this PR does not require reindexing
  • Added a (compatible changes) or A (incompatible ones) tag to commit title or this PR does not modify the Azul service API
  • Added API label to connected issues or this PR does not modify the Azul service API

Author (chains)

  • This PR is blocked by previous PR in the chain or this PR is not chained to another PR
  • Added base label to the blocking PR or this PR is not chained to another PR
  • Added chained label to this PR or this PR is not chained to another PR

Author (upgrading)

  • Documented upgrading of deployments in UPGRADING.rst or this PR does not require upgrading
  • Added u tag to commit title or this PR does not require upgrading
  • Added upgrade label to PR or this PR does not require upgrading

Author (operator tasks)

  • Added checklist items for additional operator tasks or this PR does not require additional tasks

Author (hotfixes)

  • Added F tag to main commit title or this PR does not include permanent fix for a temporary hotfix
  • Reverted the temporary hotfixes for any connected issues or the prod branch has no temporary hotfixes for any connected issues

Author (before every review)

  • Rebased PR branch on develop, squashed old fixups
  • Ran make requirements_update or this PR does not touch requirements*.txt, common.mk, Makefile and Dockerfile
  • Added R tag to commit title or this PR does not touch requirements*.txt
  • Added reqs label to PR or this PR does not touch requirements*.txt
  • make integration_test passes in personal deployment or this PR does not touch functionality that could break the IT

Peer reviewer (after requesting changes)

Uncheck the Author (before every review) checklists.

Peer reviewer (after approval)

  • PR is not a draft
  • Ticket is in Review requested column
  • Requested review from primary reviewer
  • Assigned PR to primary reviewer

Primary reviewer (after requesting changes)

Uncheck the before every review checklists. Update the N reviews label.

Primary reviewer (after approval)

  • Actually approved the PR
  • Labeled connected issues as demo or no demo
  • Commented on connected issues about demo expectations or all connected issues are labeled no demo
  • Decided if PR can be labeled no sandbox
  • PR title is appropriate as title of merge commit
  • N reviews label is accurate
  • Moved ticket to Approved column
  • Assigned PR to current operator

Operator (before pushing merge the commit)

  • Checked reindex label and r commit title tag
  • Checked that demo expectations are clear or all connected issues are labeled no demo
  • Squashed PR branch and rebased onto develop
  • Sanity-checked history
  • Pushed PR branch to GitHub
  • Pushed PR branch to GitLab dev and added sandbox label or PR is labeled no sandbox
  • Pushed PR branch to GitLab anvildev or PR is labeled no sandbox
  • Build passes in sandbox deployment or PR is labeled no sandbox
  • Build passes in anvilbox deployment or PR is labeled no sandbox
  • Reviewed build logs for anomalies in sandbox deployment or PR is labeled no sandbox
  • Reviewed build logs for anomalies in anvilbox deployment or PR is labeled no sandbox
  • Deleted unreferenced indices in sandbox or this PR does not remove catalogs or otherwise causes unreferenced indices
  • Deleted unreferenced indices in anvilbox or this PR does not remove catalogs or otherwise causes unreferenced indices
  • Started reindex in sandbox or this PR does not require reindexing sandbox
  • Started reindex in anvilbox or this PR does not require reindexing sandbox
  • Checked for failures in sandbox or this PR does not require reindexing sandbox
  • Checked for failures in anvilbox or this PR does not require reindexing sandbox
  • Added PR reference to merge commit title
  • Collected commit title tags in merge commit title
  • Moved connected issues to Merged column in ZenHub
  • Pushed merge commit to GitHub

Operator (after pushing the merge commit)

  • Shortened the PR chain or this PR is not labeled base
  • Pushed merge commit to GitLab dev or PR is labeled no sandbox
  • Pushed merge commit to GitLab anvildev or PR is labeled no sandbox
  • Build passes on GitLab dev1
  • Reviewed build logs for anomalies on GitLab dev1
  • Build passes on GitLab anvildev1
  • Reviewed build logs for anomalies on GitLab anvildev1
  • Deleted PR branch from GitHub
  • Deleted PR branch from GitLab dev
  • Deleted PR branch from GitLab anvildev

1 When pushing the merge commit is skipped due to the PR being
labelled no sandbox, the next build triggered by a PR whose merge commit is
pushed determines this checklist item.

Operator (reindex)

  • Deleted unreferenced indices in dev or this PR does not remove catalogs or otherwise causes unreferenced indices
  • Deleted unreferenced indices in anvildev or this PR does not remove catalogs or otherwise causes unreferenced indices
  • Started reindex in dev or this PR does not require reindexing
  • Started reindex in anvildev or this PR does not require reindexing
  • Checked for and triaged indexing failures in dev or this PR does not require reindexing
  • Checked for and triaged indexing failures in anvildev or this PR does not require reindexing
  • Emptied fail queues in dev deployment or this PR does not require reindexing
  • Emptied fail queues in anvildev deployment or this PR does not require reindexing

Operator

  • Unassigned PR

Shorthand for review comments

  • L line is too long
  • W line wrapping is wrong
  • Q bad quotes
  • F other formatting problem

@github-actions github-actions bot added the orange [process] Done by the Azul team label Mar 17, 2023
@nadove-ucsc nadove-ucsc added the reindex:dev [process] PR requires reindexing dev label Mar 17, 2023
@nadove-ucsc nadove-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch 3 times, most recently from 4e0d472 to 6173c7c Compare March 17, 2023 23:48
src/azul/indexer/__init__.py Fixed Show fixed Hide fixed
src/azul/types.py Fixed Show fixed Hide fixed
@nadove-ucsc nadove-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch 2 times, most recently from 78b9ce3 to 3388fe9 Compare March 18, 2023 00:07
@coveralls
Copy link

coveralls commented Mar 18, 2023

Coverage Status

Coverage: 84.62% (-0.2%) from 84.78% when pulling c2ee74e on issues/noah-aviel-dove/5000-index-supp-files-anvil into 48fcfd5 on develop.

@codecov
Copy link

codecov bot commented Mar 18, 2023

Codecov Report

Merging #5059 (0b50e33) into develop (48fcfd5) will decrease coverage by 0.04%.
The diff coverage is 70.00%.

❗ Current head 0b50e33 differs from pull request most recent head c2ee74e. Consider uploading reports for the commit c2ee74e to get more accurate results

@@             Coverage Diff             @@
##           develop    #5059      +/-   ##
===========================================
- Coverage    84.75%   84.72%   -0.04%     
===========================================
  Files          149      149              
  Lines        18248    18228      -20     
===========================================
- Hits         15467    15443      -24     
- Misses        2781     2785       +4     
Impacted Files Coverage Δ
src/azul/plugins/repository/canned/__init__.py 0.00% <0.00%> (ø)
src/azul/plugins/repository/tdr_anvil/__init__.py 32.03% <28.57%> (-0.30%) ⬇️
src/azul/plugins/repository/tdr_hca/__init__.py 89.95% <92.85%> (ø)
src/azul/indexer/__init__.py 100.00% <100.00%> (ø)
src/azul/plugins/repository/dss/__init__.py 90.43% <100.00%> (ø)
src/azul/plugins/repository/tdr.py 91.20% <100.00%> (ø)

... and 2 files with indirect coverage changes

@nadove-ucsc nadove-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch from 150f102 to 14bacb2 Compare March 18, 2023 01:16
@nadove-ucsc nadove-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch 5 times, most recently from 47617da to 2deef77 Compare March 18, 2023 06:18
@nadove-ucsc nadove-ucsc added the chained [process] PR needs to based of develop before merging label Mar 24, 2023
@nadove-ucsc nadove-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch from 0002e32 to 99522e5 Compare March 24, 2023 00:10
@nadove-ucsc nadove-ucsc changed the base branch from develop to issues/noah-aviel-dove/5053-it-can-bundle-slow March 24, 2023 00:13
@nadove-ucsc nadove-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch 2 times, most recently from ec7ab38 to 0550d85 Compare March 24, 2023 19:45
@nadove-ucsc
Copy link
Contributor Author

Red status is due to the PR being chained causing false positives for CodeQL. Every commit except for the last should be semantically neutral.

Copy link
Contributor

@dsotirho-ucsc dsotirho-ucsc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nicely done. Just some nitpick suggestions.

return None
else:
return self._parse_drs_path(file_ref)
def _columns(self, entity_type: EntityType) -> set[str]:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This seems out of place at the bottom of the file. Could it be put either above or below _convert_column?

@@ -52,6 +49,12 @@ class BundleFQID(SupportsLessThan):
uuid: BundleUUID
version: BundleVersion

def fqid_json(self) -> MutableJSON:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
def fqid_json(self) -> MutableJSON:
def as_json(self) -> MutableJSON:

Comment on lines 127 to 124
def fqid_json(self) -> MutableJSON:
result = super().fqid_json()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
def fqid_json(self) -> MutableJSON:
result = super().fqid_json()
def as_json(self) -> MutableJSON:
result = super().as_json()

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Decided at PL to change shape of the bundle notification so that source can be included in the return value of this method.

'bundle_uuid': bundle_fqid.uuid,
'bundle_version': bundle_fqid.version
},
'match': bundle_fqid.fqid_json()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
'match': bundle_fqid.fqid_json()
'match': bundle_fqid.as_json()

@dsotirho-ucsc dsotirho-ucsc removed their assignment Mar 24, 2023
@hannes-ucsc hannes-ucsc force-pushed the issues/noah-aviel-dove/5053-it-can-bundle-slow branch from c7fcd9f to 06584fb Compare March 26, 2023 18:35
@hannes-ucsc hannes-ucsc added 3 reviews [process] Lead requested changes thrice and removed 2 reviews [process] Lead requested changes twice labels May 4, 2023
@hannes-ucsc hannes-ucsc removed their assignment May 4, 2023
@nadove-ucsc nadove-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch from 650fc5a to c739bee Compare May 4, 2023 01:04
@nadove-ucsc nadove-ucsc requested a review from hannes-ucsc May 4, 2023 01:52
@achave11-ucsc achave11-ucsc force-pushed the issues/noah-aviel-dove/5000-index-supp-files-anvil branch from c739bee to c2ee74e Compare May 5, 2023 15:58
@achave11-ucsc achave11-ucsc added the sandbox [process] Resolution is being verified in sandbox deployment label May 5, 2023
@achave11-ucsc achave11-ucsc merged commit e06862b into develop May 5, 2023
@achave11-ucsc achave11-ucsc deleted the issues/noah-aviel-dove/5000-index-supp-files-anvil branch May 5, 2023 18:15
@achave11-ucsc achave11-ucsc removed their assignment May 5, 2023
@dsotirho-ucsc dsotirho-ucsc removed the base [process] Another PR needs to be rebased before merging this one label May 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3 reviews [process] Lead requested changes thrice orange [process] Done by the Azul team reindex:dev [process] PR requires reindexing dev sandbox [process] Resolution is being verified in sandbox deployment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants