References for: SAC-TD3Hybrid
Full identifier: https://w3id.org/np/RAgOeYsOZ9JMrf7eZXm9LfwjGMukSuzSpk6eeHFRpRlNo#SAC-TD3Hybrid
Nanopublication | Part | Subject | Predicate | Object | Published By | Published On |
---|---|---|---|---|---|---|
links a nanopublication to its assertion
http://www.nanopub.org/nschema#hasAssertion
assertion
|
SAC-TD3Hybrid
|
Tobias Kuhn
|
2024-10-04T09:20:40.120Z
|
|||
links a nanopublication to its assertion
http://www.nanopub.org/nschema#hasAssertion
assertion
|
SAC-TD3Hybrid
|
(unknown)
|
2024-10-02T09:51:13.183Z
|
|||
links a nanopublication to its assertion
http://www.nanopub.org/nschema#hasAssertion
assertion
|
SAC-TD3Hybrid
|
SAC-TD3 Hybrid
|
(unknown)
|
2024-10-02T09:51:13.183Z
|
||
links a nanopublication to its assertion
http://www.nanopub.org/nschema#hasAssertion
assertion
|
SAC-TD3Hybrid
|
(unknown)
|
2024-10-02T09:51:13.183Z
|
|||
links a nanopublication to its assertion
http://www.nanopub.org/nschema#hasAssertion
assertion
|
SAC-TD3Hybrid
|
SAC-TD3 is an algorithm incorporating elements both from the SAC and TD3 algorithms for reinforcement learning. It incorporates the idea of entropy regularization from SAC while borrowing target policy smoothening, delayed policy updates from the TD3 algorithm. It has been used to estimate the reaction barrier given a potential energy surface. Stable states in complex systems correspond to local minima on the associated potential energy surface, transitions between which govern the dynamics of the system. Precisely determining the transition pathways in complex and high-dimensional systems is challenging because these transitions are rare events, and the system remains near a local minimum for most of the time. The probability of such transitions decreases exponentially with the height of the energy barrier, making the system's dynamics highly sensitive to the calculated energy barriers. This problem has is formulated as a cost-minimization problem and solved using the aforementioned reinforcement learning algorithm. The exploratory nature of the algorithm enables efficient sampling and better estimation of the minimum energy barrier for transitions.
|
(unknown)
|
2024-10-02T09:51:13.183Z
|
||
links a nanopublication to its assertion
http://www.nanopub.org/nschema#hasAssertion
assertion
|
SAC-TD3Hybrid
|
(unknown)
|
2024-10-02T09:51:13.183Z
|
|||
links a nanopublication to its assertion
http://www.nanopub.org/nschema#hasAssertion
assertion
|
SAC-TD3Hybrid
|
(unknown)
|
2024-10-02T09:51:13.183Z
|
|||
links a nanopublication to its pubinfo
http://www.nanopub.org/nschema#hasPublicationInfo
pubinfo
|
SAC-TD3Hybrid
|
(unknown)
|
2024-10-02T09:51:13.183Z
|