殷萇茗教授
發(fā)布時間: 2020-03-16 10:30:10 瀏覽量:
長沙理工大學(xué)計算機與通信工程學(xué)研究生導(dǎo)師基本信息表 |
||||
1、個人基本信息: |
||||
姓 名:殷萇茗 |
性 別:男 |
|
||
出生年月:1964年5月 |
技術(shù)職稱:教授 |
|||
畢業(yè)院校:上海大學(xué) |
學(xué)歷(學(xué)位):博士 |
|||
所在學(xué)科:運籌學(xué)與控制論專業(yè) |
研究方向:算法與計算機軟件;機器學(xué)習(xí)與智能控制 |
|||
2、教育背景: |
||||
北京師范大學(xué) |
學(xué)士 |
|||
1998 |
國防科技大學(xué) |
碩士 |
||
2006 |
上海大學(xué) |
博士 |
||
3、目前研究領(lǐng)域: |
||||
算法與計算機軟件;機器學(xué)習(xí)與智能控制 |
||||
4、已完成或已在承擔(dān)的主要課題: |
||||
1. 智能體在部分可觀測馬爾可夫環(huán)境下的激勵學(xué)習(xí)研究, 國家自然科學(xué)基金,2002-2005 2. 多時間尺度風(fēng)險敏感度MDP研究,理工大學(xué)科研基金 3. 湖南省青年骨干教師培養(yǎng)對象,湖南省教育廳 4. 1火力發(fā)電廠分布式數(shù)據(jù)采集與故障診斷系統(tǒng),湖南省電力局科研項目(1998年),已結(jié)題 ,6萬元,主持。 5. 智能體在部分可觀測馬爾可夫環(huán)境下的激勵學(xué)習(xí)研究,國家自然科學(xué)基金項目,在 研 ,20萬元,主研。 6. 江西省地區(qū)電網(wǎng)負(fù)荷預(yù)測與分析系統(tǒng),江西省電力總公司, 已結(jié)題,50萬元,主研。 7. 教學(xué)管理軟件的開發(fā)與推廣,長沙電力學(xué)院教研項目 (2000年),已結(jié)題,0.5萬元,主研。 8. 激勵學(xué)習(xí)算法的收斂性研究,湖南省教委科研項目 (2000年),已結(jié)題,0.5萬元,主研。 9. 激勵學(xué)習(xí)智能體最優(yōu)控制策略及其在微經(jīng)濟環(huán)境下的決策問題,湖南省教育廳科研基金項目(2007),在研,1萬元,主持。 10. 7、多時間參數(shù)風(fēng)險敏感度MDP研究,長沙理工大學(xué)科研基金項目(2006),在研,3萬元,主持。 11. |
||||
5、已出版的主要著作: |
||||
|
||||
6、已發(fā)表的學(xué)術(shù)論文: |
||||
1. Optimal Equality for Multi-Time Scale Risk-Sensitive Markov Decision Processes,Proceedings inISCST, 2005,Ningbo, China 2. Automatic Discovery of Subgoals for Sequential Decision Problems Using Potential Fields,Proceedings in ICNC, 2005: 384-391. 3. 求解POMDP的動態(tài)合并激勵學(xué)習(xí)算法,計算機工程, No.19,2005 4. 基于動態(tài)規(guī)劃的激勵學(xué)習(xí)遺忘算法,計算機工程與應(yīng)用, 2004,Vol 40, No.20 5. Reinforcement Learning Forgetting Algorithm Based on Dynamic Programming,Journal of Computer Engineering and Applications, 2004, Vol 40, No.20. 6. Average Asymptotic Temporal Difference Learning Forgetting Algorithm on Eligibility Trace, Journal of Changsha University of Electric Power, 2003 (4). 7. Reinforcement Learning Algorithm for Solving RTDP with Variational Environment. ICGST International Journal on Artificial Intelligence and Machine Learning (AIML), Volume (7), Issue (I), pp17-21. 8. Reinforcement Learning Algorithms Based on mGA and EA with Policy Iterations. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Bio-Inspired Computational Intelligence and Applications - International Conference on Life System Modeling and Simulation, LSMS 2007, Proceedings v 4688 LNCS 2007. 9. Risk-Sensitive Reinforcement Learning Algorithms with Generalized Average Criterion. Applied Mathematics and Mechanics-English Edition, 2007, V28, N3 ( MAR ) , pp405-416. 10. Global Attractor for KGS Lattice System.Applied Mathematics and Mechanics-English Edition, 2007, V28, N5 (MAC), pp619-628. 11. Fused Sarsa(lambda)Learning Algorithm Based-on Multi-agent. Journal of Computer Engineering and Applications,2008, 44 (4), pp182-183. 12. Automatic Discovery of Subgoals for Sequential Decision Problems Using Potential Fields. 2005 International Conference on Natural Computation/2005 International Conference on Fuzzy Systems and knowledge Discovery (ICNC'05-FSKD'05), IEEE. 27-29 August 2005, Changsha , China . (Lecture Notes in Computer Science, v 3612, n PART III, Advances in Natural Computation: First International Conference, ICNC 2005. Proceedings, 2005, pp384-391) 13. Optimal Equality for Multi-Time Scale Risk-Sensitive Markov Decision Processes. Proceedings in the International Symposium on Computer Science and Technology 2005, Ningbo , China . 14. Reinforcement Learning Algorithm Based-on Policy Iteration for Solving RTDP. 2006.8, ISAI’2006, Beijing , China . 15. U-Clustering: A Reinforcement Learning Algorithm Based on Utility Clustering. Journal of Computer Engineering and Applications, 2005, No.20. 16. Reinforcement Learning Forgetting Algorithm Based on Dynamic Programming. Journal of Computer Engineering and Applications, 2004, No.20. 17. The Dynamic Merge Reinforcement Learning Algorithm for Solving POMDP. Journal of Computer Engineering. 2005, 11. 18. Multi-Time Scale Risk-Sensitive Hierarchical Structure Control Problem. DCABES2006, Hangzhou , China , 2006.10. 19. Utility Clustering for Reinforcement Learning with Partial Observability. In Proceedings of Conference of Chinese Intelligence Automatization, HongKong , China , 2003.(IJCAI03). 20. Average Asymptotic Temporal Difference Learning Forgetting Algorithm on Eligibility Trace, Journal of Changsha University of Electric Power, 2003 (4). 21. Nonlinear Control Based on Q-learning Algorithms. Journal of Changsha University of Electric Power, Val.18, No.1, 2003 (1). 22. A Relative Value Iteration Q-Learning Algorithm and Its Convergence Based-on Finite Samples. Journal of Computer Research and Development. Sept.2002, Vol.39, No.9. 23. Optimality Cost Relative Value Iteration Q-Learning Algorithm Based on Finite Samples. Journal of Computer Engineering and Applications, 2002, No.14. 24. Generalize Average Algorithm for Reinforcement Learning Its Convergence. Journal of Computer Engineering and Applications, 2002, No.20. 25. Reinforcement Learning Algorithm Based on average Cost Optimization for Each Stage. Journal of Computer Applications, Val.22, No.4, 2002 (4). 26. Classification for Un-labeled Context Based on Maximum Expectation Learning Algorithm. Proceedings of 14th CDC (Annul Conference of Control and Decision, China ). 27. ATD(lambda) Learning Forgetting Algorithm. Proceedings of 4th Machine and Electric Engineering Association of Hunan , China , Aug. 2002. 28. Distributed Real-time System for Electric Power Enterprise Based on Intranet/Web. Journal of Applications of the Computer Systems, 2002(4). 29. The Uniform of Security Policy in Distributed System. Journal of Information Engineering University , 2001. (Proceedings of Annual Conference of Chinese Networks and Information Security, Zhengzhou , China , 2001). 30. Design of Distributed Real Time Database System Based on JDBC/Web. Journal of Computer Development and Applications. 2001,No.36. 31. The Application Delphi Multi-thread for Distributed Real time Multi-task System. Journal of Changsha University of Electric Power, Val.15, No.1, 2001 (1). 32. Comparing ARP of IPv4 with Neighbor Discovery Protocol of IPv6. Journal of Changsha University of Electric Power, Val.16, No.1, 2001 (1). 33. Study and Application of Distributed Real Time Multimedia Database. Journal of Changsha University of Electric Power, Val.16, No.2, 2001 (2). 34. The Design of Real-time Monitor Database System Based on Distributed Heterogeneous Networks Environment. Journal of Changsha University of Electric Power, Val.16, No.3, 2001 (3). 35. Distributed Real-time Multi-task System Study and Application for Monitoring and Supervising in Electric Power Plant. Proceedings of 1st Machine and Electric Engineering Association of Hunan , China , Aug, 1999. 36. The Principles and Design Methods for Domain Service System of Campus Networks. Journal of Changsha University of Electric Power, Val.13, No.1, 1998(1). 37. Security Study for Windows NT Network Management. Journal of Changsha University of Electric Power, Val.13, No.2, 1998(2). 38. The Weighed Lorentz Norm Inequality of Generalization Maximum Operator. Annual of Hunan Mathematics, Val 17, No.2, 1997. 39. The Weighted boundary of Operator and its interpolation on Mixed Lebesgue Space. Journal of Changsha University of Electric Power, Val.12, No.3, 1997 (3). 40. The Alternativeness of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.8, No.2, 1993 (2). 41. The Combiner Theory of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.6, No.2, 1991 (2). 42. The Equivalence Conditions for Reductionable Elements on Complex Commutative Banach Algebra. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.1, 1990 (1). 43. F-Set on Unit square-cube under n-Dimension Euclid Space. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.2, 1990 (2). |
||||
7、所獲學(xué)術(shù)榮譽及學(xué)術(shù)影響: |
||||
1. 1998年度 獲長沙電力學(xué)院“優(yōu)秀教師” 2. 1998年度 獲系“優(yōu)秀畢業(yè)實習(xí)指導(dǎo)教師” 3. 2000年度 獲長沙電力學(xué)院“優(yōu)秀教師” 4. 2000年度 獲長沙電力學(xué)院“優(yōu)質(zhì)課獎” 5. 2001年度 獲長沙電力學(xué)院“優(yōu)秀教師” 6. 2002年度 獲長沙電力學(xué)院“優(yōu)秀教師” 7. 2002年度 獲“華中電力集團獎教基金獎”三等獎 8. 2003年度 湖南省高等學(xué)校青年骨干教師培養(yǎng)對象
|
||||
上一篇:蔣加伏教授
下一篇:徐學(xué)軍教授