DOI: Pull DOIs from URL #2967

AbeJellinek · 2023-01-23T17:02:07Z

If only the URL contains a DOI (or the URL contains a DOI and the body contains only that same DOI), we now detect a single journalArticle item and don't show the Select Items dialog.

Also optimized the page-scraping code a little bit by replacing Array#includes() calls (O(n^2) overall) with Set#has() (O(n) overall).

Fixes #2964

If only the URL contains a DOI (or the URL contains a DOI and the body contains only that same DOI), we now detect a single `journalArticle` item and don't show the Select Items dialog. Also optimized the page-scraping code a little bit by replacing Array#includes() calls (O(n^2) overall) with Set#has() (O(n) overall).

DOI.js

dstillman

Looks good otherwise!

AbeJellinek · 2023-01-24T15:25:13Z

We can add a test for single-item detection (DOI in URL and no other DOIs on the page) if we can find an example, but I really could not. Pages with a DOI in the URL almost always contain another DOI somewhere.

adam3smith · 2023-01-24T17:19:58Z

We can add a test for single-item detection (DOI in URL and no other DOIs on the page)

I can find you an example that has the same DOI on the page but no others, would that do?

AbeJellinek · 2023-01-24T17:20:29Z

Yeah, that would.

adam3smith · 2023-01-25T13:33:07Z

https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/BEJTMI

AbeJellinek · 2023-01-25T15:28:44Z

Thanks! With the other fix included in that last commit, we now have two tests for single items.

AbeJellinek requested review from adam3smith and dstillman January 23, 2023 17:02

AbeJellinek added 2 commits January 23, 2023 12:03

Fix capitalization

f13c206

Fix lint error

503c3db

dstillman reviewed Jan 24, 2023

View reviewed changes

DOI.js Outdated Show resolved Hide resolved

dstillman approved these changes Jan 24, 2023

View reviewed changes

getDOIsFromURL() -> getDOIFromURL()

79687c8

Add tests, skip links to anchors on DOIs that are already in the set

e9330f0

Restore array length check

cdf9ed2

AbeJellinek merged commit 48a038d into zotero:master Mar 10, 2023

AbeJellinek deleted the doi-match-url branch March 10, 2023 16:20

zoe-translates pushed a commit to zoe-translates/translators that referenced this pull request Mar 12, 2023

DOI: Pull DOIs from URL (zotero#2967)

7373ab0

dstillman mentioned this pull request Apr 12, 2023

Always use open PDF for attachment regardless of translator zotero/zotero-connectors#431

Closed

AbeJellinek mentioned this pull request Aug 21, 2024

DOI: Check URL before page contents #1795

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOI: Pull DOIs from URL #2967

DOI: Pull DOIs from URL #2967

AbeJellinek commented Jan 23, 2023

dstillman left a comment

AbeJellinek commented Jan 24, 2023

adam3smith commented Jan 24, 2023

AbeJellinek commented Jan 24, 2023

adam3smith commented Jan 25, 2023

AbeJellinek commented Jan 25, 2023

DOI: Pull DOIs from URL #2967

DOI: Pull DOIs from URL #2967

Conversation

AbeJellinek commented Jan 23, 2023

dstillman left a comment

Choose a reason for hiding this comment

AbeJellinek commented Jan 24, 2023

adam3smith commented Jan 24, 2023

AbeJellinek commented Jan 24, 2023

adam3smith commented Jan 25, 2023

AbeJellinek commented Jan 25, 2023