abstract (explore) | nanodash

Login with ORCID

abstract

Full identifier: http://purl.org/dc/terms/abstract

Used in 2 templates:

Describing core article metadata - by Tobias Kuhn, 2024-11-19
Defining a FAIR Connect Thumbnail Article - by Barbara Magagna, 2023-01-10

References

Nanopublication	Part	Subject	Predicate	Object	Published By	Published On
https://w3id.org/np/RA48p4Ct8t... RA48p4Ct8t	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	abstract	http://www.w3.org/2000/01/rdf-schema#label label	has the abstract	Tobias Kuhn	2024-11-19T13:45:55.152Z
https://w3id.org/kpxl/ios/ds/np/RAn08NC9is... RAn08NC9is	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	https://doi.org/10.3233/DS-240059 doi:10.3233/DS-240059	abstract	Measuring data drift is essential in machine learning applications where model scoring (evaluation) is done on data samples that differ from those used in training. The Kullback-Leibler divergence is a common measure of shifted probability distributions, for which discretized versions are invented to deal with binned or categorical data. We present the Unstable Population Indicator, a robust, flexible and numerically stable, discretized implementation of Jeffrey's divergence, along with an implementation in a Python package that can deal with continuous, discrete, ordinal and nominal data in a variety of popular data types. We show the numerical and statistical properties in controlled experiments. It is not advised to employ a common cut-off to distinguish stable from unstable populations, but rather to let that cut-off depend on the use case.	Tobias Kuhn	2024-07-12T09:07:29.273Z
https://w3id.org/kpxl/ios/ds/np/RAZbl5xPyu... RAZbl5xPyu	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	https://doi.org/10.3233/DS-220058 doi:10.3233/DS-220058	abstract	Sarcasm is a linguistic phenomenon often indicating a disparity between literal and inferred meanings. Due to its complexity, it is typically difficult to discern it within an online text message. Consequently, in recent years sarcasm detection has received considerable attention from both academia and industry. Nevertheless, the majority of current approaches simply model low-level indicators of sarcasm in various machine learning algorithms. This paper aims to present sarcasm in a new light by utilizing novel indicators in a deep weighted average ensemble-based framework (DWAEF). The novel indicators pertain to exploiting the presence of simile and metaphor in text and detecting the subtle shift in tone at a sentence’s structural level. A graph neural network (GNN) structure is implemented to detect the presence of simile, bidirectional encoder representations from transformers (BERT) embeddings are exploited to detect metaphorical instances and fuzzy logic is employed to account for the shift of tone. To account for the existence of sarcasm, the DWAEF integrates the inputs from the novel indicators. The performance of the framework is evaluated on a self-curated dataset of online text messages. A comparative report between the results acquired using primitive features and those obtained using a combination of primitive features and proposed indicators is provided. The highest accuracy of 92% was achieved after applying DWAEF, the proposed framework which combines the primitive features and novel indicators together as compared to 78.58% obtained using Support Vector Machine (SVM) which was the lowest among all classifiers.	Tobias Kuhn	2024-07-12T07:02:55.649Z
https://w3id.org/np/RAyW5v4w76... RAyW5v4w76	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	https://doi.org/10.46471/gigabyte.99 doi:10.46471/gigabyte.99	abstract	In China, 65 types of venomous snakes exist, with the Chinese Cobra Naja atra being prominent and a major cause of snakebites in humans. Furthermore, N. atra is a protected animal in some areas, as it has been listed as vulnerable by the International Union for Conservation of Nature. Recently, due to the medical value of snake venoms, venomics has experienced growing research interest. In particular, genomic resources are crucial for understanding the molecular mechanisms of venom production. Here, we report a highly continuous genome assembly of N. atra, based on a snake sample from Huangshan, Anhui, China. The size of this genome is 1.67 Gb, while its repeat content constitutes 37.8% of the genome. A total of 26,432 functional genes were annotated. This data provides an essential resource for studying venom production in N. atra. It may also provide guidance for the protection of this species.	Scott C Edmunds	2023-12-05T03:26:23.359Z
https://w3id.org/np/RAi0yZT6cY... RAi0yZT6cY	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	https://doi.org/10.1109/access.2023.3269660 doi:10.1109/access.2023.3269660	abstract	Topic modeling comprises a set of machine learning algorithms that allow topics to be extracted from a collection of documents. These algorithms have been widely used in many areas, such as identifying dominant topics in scientific research. However, works addressing such problems focus on identifying static topics, providing snapshots that cannot show how those topics evolve. Aiming to close this gap, in this article, we describe an approach for dynamic article set analysis and classification. This is accomplished by querying open data of notable scientific databases via representational state transfers. After that, we enforce data management practices with a dynamic topic modeling approach on the associated metadata available. As a result, we identify research trends for a given field at specific instants and the referred terminology trends evolution throughout the years. It was possible to detect the associated lexical variation over time in published content, ultimately determining the so-called “hot topics” in arbitrary instants and how they correlate.	Tobias Kuhn	2023-11-24T11:26:16.582Z
https://w3id.org/np/RAA0PugeMk... RAA0PugeMk	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	https://doi.org/10.1109/access.2023.3269660 doi:10.1109/access.2023.3269660	abstract	Topic modeling comprises a set of machine learning algorithms that allow topics to be extracted from a collection of documents. These algorithms have been widely used in many areas, such as identifying dominant topics in scientific research. However, works addressing such problems focus on identifying static topics, providing snapshots that cannot show how those topics evolve. Aiming to close this gap, in this article, we describe an approach for dynamic article set analysis and classification. This is accomplished by querying open data of notable scientific databases via representational state transfers. After that, we enforce data management practices with a dynamic topic modeling approach on the associated metadata available. As a result, we identify research trends for a given field at specific instants and the referred terminology trends evolution throughout the years. It was possible to detect the associated lexical variation over time in published content, ultimately determining the so-called “hot topics” in arbitrary instants and how they correlate.	(unknown)	2023-11-22T19:26:28.402Z
http://purl.org/np/RA_VrHwZaW... RA_VrHwZaW	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	https://doi.org/10.3233/DS-230058 doi:10.3233/DS-230058	abstract	Sarcasm is a linguistic phenomenon often indicating a disparity between literal and inferred meanings. Due to its complexity, it is typically difficult to discern it within an online text message. Consequently, in recent years sarcasm detection has received considerable attention from both academia and industry. Nevertheless, the majority of current approaches simply model low-level indicators of sarcasm in various machine learning algorithms. This paper aims to present sarcasm in a new light by utilizing novel indicators in a deep weighted average ensemble-based framework (DWAEF). The novel indicators pertain to exploiting the presence of simile and metaphor in text and detecting the subtle shift in tone at a sentence’s structural level. A graph neural network (GNN) structure is implemented to detect the presence of simile, bidirectional encoder representations from transformers (BERT) embeddings are exploited to detect metaphorical instances and fuzzy logic is employed to account for the shift of tone. To account for the existence of sarcasm, the DWAEF integrates the inputs from the novel indicators. The performance of the framework is evaluated on a self-curated dataset of online text messages. A comparative report between the results acquired using primitive features and those obtained using a combination of primitive features and proposed indicators is provided. The highest accuracy of 92% was achieved after applying DWAEF, the proposed framework which combines the primitive features and novel indicators together as compared to 78.58% obtained using Support Vector Machine (SVM) which was the lowest among all classifiers.	Tobias Kuhn	2023-07-21T14:01:37.941Z
http://purl.org/np/RAyE_qJYzZ... RAyE_qJYzZ	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	http://purl.org/np/RAyE_qJYzZ...#DSP4FAIR DSP4FAIR	abstract	Data Stewardship Plan (DSP) templates prompt users to consider various issues but typically have no requirements for actual implementation choices. But as FAIR methodologies mature the DSP will become a more directive “how to” manual for making data FAIR.	Erik Schultes	2023-01-10T15:20:14.927Z
http://purl.org/np/RAeqKfXKi6... RAeqKfXKi6	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	abstract	http://www.w3.org/2000/01/rdf-schema#label label	has abstract	Barbara Magagna	2023-01-10T14:07:49.629Z
http://purl.org/np/RA8XaAdN1X... RA8XaAdN1X	links a nanopublication to its assertion http://www.nanopub.org/nschema#hasAssertion assertion	http://purl.org/np/RA8XaAdN1X...#DSP4FAIR DSP4FAIR	abstract	Data Stewardship Plan (DSP) templates prompt users to consider various issues but typically have no requirements for actual implementation choices. But as FAIR methodologies mature the DSP will become a more directive “how to” manual for making data FAIR.	Erik Schultes	2023-01-10T08:53:03.053Z