PRJEB41370 - Name - OmicsDI

Browse
Submit Data
Databases
API
Help

Genomics

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Name

ABSTRACT: title

PROVIDER: PRJEB41370 | ENA |

REPOSITORIES: ENA

Similar Datasets

Project description:Study Title

| PRJEB42460 | ENA

Project description:Study Title

| PRJEB44843 | ENA

Project description:Test Study title

| PRJEB44939 | ENA

Project description:Test Study title

| PRJEB44938 | ENA

Project description:Temporary Short Descriptive Study Title

| PRJEB55127 | ENA

Project description:Title of first study

| PRJEB32895 | ENA

Study name test bla

Project description:Short descriptive study title test

| PRJEB74323 | ENA

provide-a-short-name-for-the-study:

Project description:Please provide a short descriptive title for the study

| PRJEB19333 | ENA

Project description:Umbrella child test

| PRJEB44969 | ENA

Ethnicity-based name partitioning for author name disambiguation using supervised machine learning.

Project description:In several author name disambiguation studies, some ethnic name groups such as East Asian names are reported to be more difficult to disambiguate than others. This implies that disambiguation approaches might be improved if ethnic name groups are distinguished before disambiguation. We explore the potential of ethnic name partitioning by comparing performance of four machine learning algorithms trained and tested on the entire data or specifically on individual name groups. Results show that ethnicity-based name partitioning can substantially improve disambiguation performance because the individual models are better suited for their respective name group. The improvements occur across all ethnic name groups with different magnitudes. Performance gains in predicting matched name pairs outweigh losses in predicting nonmatched pairs. Feature (e.g., coauthor name) similarities of name pairs vary across ethnic name groups. Such differences may enable the development of ethnicity-specific feature weights to improve prediction for specific ethic name categories. These findings are observed for three labeled data with a natural distribution of problem sizes as well as one in which all ethnic name groups are controlled for the same sizes of ambiguous names. This study is expected to motive scholars to group author names based on ethnicity prior to disambiguation.

| S-EPMC8359369 | biostudies-literature

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data