搜索到65086篇“ REINFORCEMENT“的相关文章
离线强化学习研究综述
2025年
离线强化学习也称为批量强化学习,是深度强化学习领域的一项重要研究内容。它利用行为策略生成静态数据集,无需在线和环境交互,成功地将大规模数据集转变成强大的决策引擎。近年来,离线强化学习方法得到了广泛关注和深入研究,并在实际应用中取得了瞩目的成绩。目前,该方法已经用于推荐系统、导航驾驶、自然语言处理、机器人控制以及医疗与能源等应用领域,并被看作是现实世界应用强化学习最具潜力的技术途径之一。该文首先介绍了离线强化学习的背景与理论基础。随后从求解思路出发,将离线强化学习方法分为无模型、基于模型和基于Transformer模型3大类,并对各类方法的研究现状与发展趋势进行分析。同时,对比了目前3个最流行的实验环境D4RL、RL Unplugged和NeoRL。进而介绍了离线强化学习技术在现实世界诸多领域的应用。最后,对离线强化学习进行总结与展望,以此推动更多该领域的研究工作。
乌兰刘全黄志刚张立华
关键词:人工智能
Reinforcement Learning in Mechatronic Systems: A Case Study on DC Motor Control
2025年
The integration of artificial intelligence into the development and production of mechatronic products offers a substantial opportunity to enhance efficiency, adaptability, and system performance. This paper examines the utilization of reinforcement learning as a control strategy, with a particular focus on its deployment in pivotal stages of the product development lifecycle, specifically between system architecture and system integration and verification. A controller based on reinforcement learning was developed and evaluated in comparison to traditional proportional-integral controllers in dynamic and fault-prone environments. The results illustrate the superior adaptability, stability, and optimization potential of the reinforcement learning approach, particularly in addressing dynamic disturbances and ensuring robust performance. The study illustrates how reinforcement learning can facilitate the transition from conceptual design to implementation by automating optimization processes, enabling interface automation, and enhancing system-level testing. Based on the aforementioned findings, this paper presents future directions for research, which include the integration of domain-specific knowledge into the reinforcement learning process and the validation of this process in real-world environments. The results underscore the potential of artificial intelligence-driven methodologies to revolutionize the design and deployment of intelligent mechatronic systems.
Alexander NüßgenAlexander LerchRené DegenMarcus IrmerMartin de FriesFabian RichterCecilia BoströmMargot Ruschitzka
某水库除险加固工程的主要病险及加固方案
2025年
为消除水库病害,防止水库在运行过程中发生险情,并使得水库能正常发挥效益,保护人民生命财产安全,为下游农田灌溉供水增加保障。通过现场踏勘、取样试验、计算复核,确定水库防洪不满足要求:大坝上、下游坝坡不满足抗震要求,溢洪道控制段和泄槽边墙高度不满足要求,消力池基本失去消能防冲功能;水库监测设施不完善,整个水库建筑物不能正常发挥作用。针对各建筑物提出来加固设计方案,为水库除险加固设计及施工提供了强有力的技术支撑和保障。
朱家俊陈俊宇
关键词:水库枢纽建筑物除险加固
Borehole reinforcement based on polymer materials induced by liquid-gas phase transition in simulating lunar coring
2025年
Lunar core samples are the key materials for accurately assessing and developing lunar resources.However,the difficulty of maintaining borehole stability in the lunar coring process limits the depth of lunar coring.Here,a strategy of using a reinforcement fluid that undergoes a phase transition spontaneously in a vacuum environment to reinforce the borehole is proposed.Based on this strategy,a reinforcement liquid suitable for a wide temperature range and a high vacuum environment was developed.A feasibility study on reinforcing the borehole with the reinforcement liquid was carried out,and it is found that the cohesion of the simulated lunar soil can be increased from 2 to 800 kPa after using the reinforcement liquid.Further,a series of coring experiments are conducted using a selfdeveloped high vacuum(vacuum degree of 5 Pa)and low-temperature(between-30 and 50℃)simulation platform.It is confirmed that the high-boiling-point reinforcement liquid pre-placed in the drill pipe can be released spontaneously during the drilling process and finally complete the reinforcement of the borehole.The reinforcement effect of the borehole is better when the solute concentration is between0.15 and 0.25 g/mL.
Dingqiang MoTao LiuZhiyu ZhaoLiangyu ZhuDongsheng YangYifan WuCheng LanWenchuan JiangHeping Xie
草本植物根系对土体的加筋作用
2025年
为研究根土相互作用,量化草本植物根系对土体抗剪强度的增强作用,以Wu和Waldron提出的根系固土模型为基础,考虑根系分布剪切方向、锚固作用长度和有效根数量3个因素,提出了改进模型,对草本植物加筋模型进行改进优化,并利用该模型研究根系与剪切面初始夹角、根系直径等对固土能力的影响。结果表明:改进模型能够很好地拟合试验结果;随着根系与剪切面夹角的增大,逆斜交根系对土体的加筋贡献值先增大后减小,顺斜交根系对土体的加筋贡献值先为0后增大最后减小;随着根系直径的增大,不同分布角度的根系对土体的加筋贡献值均增大,且大致呈正比例关系。
王海涛张宇刘琳琳崔明华沈向军
关键词:草本植物生态护坡根土复合体加筋作用
加固施工技术在建筑地基基础中的应用
2025年
做好地基基础加固技术的工艺控制,对保证建筑地基基础加固效果有很大的作用。基于此,文章依据卓然·铂金公馆(B区)工程项目为例,着重分析了地基基础加固施工中的注浆材料选择及制备,以及钻孔、注浆管道安装、注浆加固要点,以期为地基基础加固工程提供有益参考,提升建筑结构稳定性能。
王志华
关键词:建筑地基加固技术
Optimization of Intelligent Education Systems Based on Reinforcement Learning
2025年
This paper explores how reinforcement learning(RL)can improve intelligent education systems.RL helps make learning personal,flexible,and efficient by choosing actions based on student needs and rewards like better scores or engagement.We study its use in custom learning paths,smart testing,and teacher support,showing how it beats old methods that don’t adapt.The paper also suggests future ideas—like better RL tools,teamwork learning,and mixing RL with big language models—while noting fairness challenges.Using pretend data with 1000 students,we test RL’s power to plan learning step by step.Results show RL can lift learning by 2025%in areas like tutoring and class focus.This work gives a clear plan for using RL to make education smarter and fairer,pointing to a bright future for adaptive learning.
Sophia LI
一种进化梯度引导的强化学习算法
2025年
进化算法(Evolutionary Algorithm,EA)和深度强化学习(Deep Reinforcement Learning,DRL)的组合被认为能够结合二者的优点,即EA的强大随机搜索能力和DRL的样本效率,实现更好的策略学习。然而,现有的组合方法存在EA引入所导致的策略性能不可预测性问题。提出自适应历史梯度引导机制,其利用历史梯度信息,找到平衡探索和利用的线索,从而获得较为稳定的高质量策略,进一步将此机制融合经典的进化强化学习算法,提出一种进化梯度引导的强化学习算法(Evolutionary Gradient Guided Reinforcement Learning,EGG⁃RL)。在连续控制任务方面的实验表明,EGG⁃RL的性能表现优于其他方法。
许斌练元洪卞鸿根刘丹亓晋
关键词:进化算法
试析钢筋混凝土结构加固设计与应用
2025年
在新时期背景下,我国各地区建筑物需要进行改造与加固,尤其是钢筋混凝土结构房屋,这是因为钢筋混凝土结构长期负荷、材料老化,出现承载力下降、安全性降低等其他不良问题,如果不对钢筋混凝土结构进行加固与处理,将无法保障后续使用的安全性与可靠性。基于此,本文首先简单阐述了钢筋混凝土结构加固的重要性,其次总结加固设计存在的问题,最后提出具有科学性与有效性的加固方法,以期为钢筋混凝土结构加固设计工作有序开展提供参考。
陆秀娟
关键词:钢筋混凝土结构加固应用设计方法
Advances in the chemical modification of nanocellulose for biodegradable plastics reinforcement
2025年
As non-degradable traditional plastics contribute to environmental pollution,biodegradable polymers have been identified as a promising alternative.However,inherent drawbacks such as low toughness,poor tensile strength,and reduced thermal degradation temperatures limit the further development of biodegradable polymers.Nanocellulose has the potential to enhance the properties of biodegradable polymers without compromising their biodegradability.However,the abundant hydroxyl groups in nanocellulose’s molecular chains result in poor compatibility with hydrophobic polymers,requiring surface modification prior to their combination.This review first introduces several common biodegradable polymers and three types of nanocellulose,followed by a comprehensive analysis of the recent advancements in the chemical modification methods of nanocellulose over the last five years.These methods encompass esterification,oxidation,silylation,and graft modification.The focus of this discussion is primarily on the modification strategies,enhancement effects,and mechanisms.Furthermore,the degradability and applications of modified nanocellulose composites are summarized.Finally,the main challenges hindering the development of chemically modified nanocellulose-reinforced biodegradable polymers are proposed.It is hoped that this review will inspire future researchers to develop industrially valuable chemically modified nanocellulose-reinforced biodegradable polymers.
Shuya ZhangMingda CheRenliang HuangMei CuiWei QiRongxin Su
关键词:NANOCELLULOSEDEGRADABILITYAPPLICATION

相关作者

卜良桃
作品数:183被引量:438H指数:12
供职机构:湖南大学土木工程学院
研究主题:活性粉末混凝土 钢筋网 二次受力 RPC 钢筋混凝土梁
吴元
作品数:22被引量:38H指数:4
供职机构:东南大学
研究主题:体外预应力 体外预应力加固 双向板 高效预应力 混凝土双向板
高跃
作品数:246被引量:134H指数:5
供职机构:清华大学
研究主题:超图 点云 图结构 事件流 事件数据
王晓旭
作品数:41被引量:20H指数:3
供职机构:天津工业大学
研究主题:复合材料 预成型体 芯模 纤维束 纤维
梁栋
作品数:2被引量:0H指数:0
供职机构:四川大学高分子科学与工程学院
研究主题:动态保压 无规共聚物 ETHYLENE 性能表征 结构特征