How are paper authors uniquely identified?












12















Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?










share|improve this question




















  • 10





    ‘Harvesting papers’ - what, really, are you trying to do?

    – Jon Custer
    Dec 17 '18 at 13:17











  • @Jon Custer Get the metadata of papers provided by different sources .

    – Agelos
    Dec 17 '18 at 17:30






  • 9





    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.

    – WBT
    Dec 17 '18 at 18:27













  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).

    – Sabine
    Dec 17 '18 at 18:33






  • 3





    Related: How are scientific papers uniquely identified?

    – BlueRaja - Danny Pflughoeft
    Dec 17 '18 at 18:36
















12















Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?










share|improve this question




















  • 10





    ‘Harvesting papers’ - what, really, are you trying to do?

    – Jon Custer
    Dec 17 '18 at 13:17











  • @Jon Custer Get the metadata of papers provided by different sources .

    – Agelos
    Dec 17 '18 at 17:30






  • 9





    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.

    – WBT
    Dec 17 '18 at 18:27













  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).

    – Sabine
    Dec 17 '18 at 18:33






  • 3





    Related: How are scientific papers uniquely identified?

    – BlueRaja - Danny Pflughoeft
    Dec 17 '18 at 18:36














12












12








12


2






Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?










share|improve this question
















Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?







publications digital-researcher-id






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Dec 17 '18 at 20:08









TRiG

378215




378215










asked Dec 17 '18 at 9:05









AgelosAgelos

17227




17227








  • 10





    ‘Harvesting papers’ - what, really, are you trying to do?

    – Jon Custer
    Dec 17 '18 at 13:17











  • @Jon Custer Get the metadata of papers provided by different sources .

    – Agelos
    Dec 17 '18 at 17:30






  • 9





    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.

    – WBT
    Dec 17 '18 at 18:27













  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).

    – Sabine
    Dec 17 '18 at 18:33






  • 3





    Related: How are scientific papers uniquely identified?

    – BlueRaja - Danny Pflughoeft
    Dec 17 '18 at 18:36














  • 10





    ‘Harvesting papers’ - what, really, are you trying to do?

    – Jon Custer
    Dec 17 '18 at 13:17











  • @Jon Custer Get the metadata of papers provided by different sources .

    – Agelos
    Dec 17 '18 at 17:30






  • 9





    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.

    – WBT
    Dec 17 '18 at 18:27













  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).

    – Sabine
    Dec 17 '18 at 18:33






  • 3





    Related: How are scientific papers uniquely identified?

    – BlueRaja - Danny Pflughoeft
    Dec 17 '18 at 18:36








10




10





‘Harvesting papers’ - what, really, are you trying to do?

– Jon Custer
Dec 17 '18 at 13:17





‘Harvesting papers’ - what, really, are you trying to do?

– Jon Custer
Dec 17 '18 at 13:17













@Jon Custer Get the metadata of papers provided by different sources .

– Agelos
Dec 17 '18 at 17:30





@Jon Custer Get the metadata of papers provided by different sources .

– Agelos
Dec 17 '18 at 17:30




9




9





See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.

– WBT
Dec 17 '18 at 18:27







See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.

– WBT
Dec 17 '18 at 18:27















Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).

– Sabine
Dec 17 '18 at 18:33





Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).

– Sabine
Dec 17 '18 at 18:33




3




3





Related: How are scientific papers uniquely identified?

– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36





Related: How are scientific papers uniquely identified?

– BlueRaja - Danny Pflughoeft
Dec 17 '18 at 18:36










2 Answers
2






active

oldest

votes


















24














Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






share|improve this answer































    8














    This is exactly what ORCID tries to achieve:




    ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




    However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
    The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






    share|improve this answer





















    • 1





      I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8

      – burger
      Dec 17 '18 at 23:46











    • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!

      – L_W
      Dec 18 '18 at 6:34













    Your Answer








    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "415"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2facademia.stackexchange.com%2fquestions%2f121740%2fhow-are-paper-authors-uniquely-identified%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    24














    Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



    For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



    If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






    share|improve this answer




























      24














      Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



      For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



      If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






      share|improve this answer


























        24












        24








        24







        Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



        For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



        If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






        share|improve this answer













        Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



        For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



        If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Dec 17 '18 at 9:18









        xLeitixxLeitix

        99.6k35239381




        99.6k35239381























            8














            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






            share|improve this answer





















            • 1





              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8

              – burger
              Dec 17 '18 at 23:46











            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!

              – L_W
              Dec 18 '18 at 6:34


















            8














            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






            share|improve this answer





















            • 1





              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8

              – burger
              Dec 17 '18 at 23:46











            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!

              – L_W
              Dec 18 '18 at 6:34
















            8












            8








            8







            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






            share|improve this answer















            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.







            share|improve this answer














            share|improve this answer



            share|improve this answer








            edited Dec 18 '18 at 6:34

























            answered Dec 17 '18 at 9:20









            L_WL_W

            736110




            736110








            • 1





              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8

              – burger
              Dec 17 '18 at 23:46











            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!

              – L_W
              Dec 18 '18 at 6:34
















            • 1





              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8

              – burger
              Dec 17 '18 at 23:46











            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!

              – L_W
              Dec 18 '18 at 6:34










            1




            1





            I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8

            – burger
            Dec 17 '18 at 23:46





            I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8

            – burger
            Dec 17 '18 at 23:46













            From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!

            – L_W
            Dec 18 '18 at 6:34







            From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!

            – L_W
            Dec 18 '18 at 6:34




















            draft saved

            draft discarded




















































            Thanks for contributing an answer to Academia Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2facademia.stackexchange.com%2fquestions%2f121740%2fhow-are-paper-authors-uniquely-identified%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Bressuire

            Cabo Verde

            Gyllenstierna