How are paper authors uniquely identified?
Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.
I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.
Is there any way to uniquely identify every author?
publications digital-researcher-id
add a comment |
Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.
I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.
Is there any way to uniquely identify every author?
publications digital-researcher-id
10
‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 '18 at 13:17
@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 '18 at 17:30
9
See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 '18 at 18:27
Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 '18 at 18:33
3
Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36
add a comment |
Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.
I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.
Is there any way to uniquely identify every author?
publications digital-researcher-id
Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.
I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.
Is there any way to uniquely identify every author?
publications digital-researcher-id
publications digital-researcher-id
edited Dec 17 '18 at 20:08
TRiG
378215
378215
asked Dec 17 '18 at 9:05
AgelosAgelos
17227
17227
10
‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 '18 at 13:17
@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 '18 at 17:30
9
See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 '18 at 18:27
Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 '18 at 18:33
3
Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36
add a comment |
10
‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 '18 at 13:17
@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 '18 at 17:30
9
See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 '18 at 18:27
Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 '18 at 18:33
3
Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36
10
10
‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 '18 at 13:17
‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 '18 at 13:17
@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 '18 at 17:30
@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 '18 at 17:30
9
9
See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 '18 at 18:27
See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 '18 at 18:27
Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 '18 at 18:33
Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 '18 at 18:33
3
3
Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36
Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36
add a comment |
2 Answers
2
active
oldest
votes
Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.
For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.
If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).
add a comment |
This is exactly what ORCID tries to achieve:
ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)
However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.
1
I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
– burger
Dec 17 '18 at 23:46
From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
– L_W
Dec 18 '18 at 6:34
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "415"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2facademia.stackexchange.com%2fquestions%2f121740%2fhow-are-paper-authors-uniquely-identified%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.
For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.
If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).
add a comment |
Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.
For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.
If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).
add a comment |
Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.
For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.
If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).
Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.
For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.
If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).
answered Dec 17 '18 at 9:18
xLeitixxLeitix
99.6k35239381
99.6k35239381
add a comment |
add a comment |
This is exactly what ORCID tries to achieve:
ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)
However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.
1
I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
– burger
Dec 17 '18 at 23:46
From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
– L_W
Dec 18 '18 at 6:34
add a comment |
This is exactly what ORCID tries to achieve:
ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)
However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.
1
I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
– burger
Dec 17 '18 at 23:46
From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
– L_W
Dec 18 '18 at 6:34
add a comment |
This is exactly what ORCID tries to achieve:
ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)
However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.
This is exactly what ORCID tries to achieve:
ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)
However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.
edited Dec 18 '18 at 6:34
answered Dec 17 '18 at 9:20
L_WL_W
736110
736110
1
I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
– burger
Dec 17 '18 at 23:46
From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
– L_W
Dec 18 '18 at 6:34
add a comment |
1
I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
– burger
Dec 17 '18 at 23:46
From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
– L_W
Dec 18 '18 at 6:34
1
1
I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
– burger
Dec 17 '18 at 23:46
I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
– burger
Dec 17 '18 at 23:46
From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
– L_W
Dec 18 '18 at 6:34
From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
– L_W
Dec 18 '18 at 6:34
add a comment |
Thanks for contributing an answer to Academia Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2facademia.stackexchange.com%2fquestions%2f121740%2fhow-are-paper-authors-uniquely-identified%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
10
‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 '18 at 13:17
@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 '18 at 17:30
9
See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 '18 at 18:27
Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 '18 at 18:33
3
Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36