殷萇茗-計算機與通信工程學院

教授

當前位置: 首頁 > 師資隊伍 > 大數(shù)據(jù)系 > 教授 > 正文

殷萇茗

發(fā)布時間: 2020-03-16 10:30:10 瀏覽量:

長沙理工大學計算機與通信工程學研究生導師基本信息表
1、個人基本信息：
姓名：殷萇茗		性別：男
出生年月：1964年5月		技術職稱：教授
畢業(yè)院校：上海大學		學歷（學位）：博士
所在學科：運籌學與控制論專業(yè)		研究方向：算法與計算機軟件；機器學習與智能控制
2、教育背景：
	北京師范大學		學士
1998	國防科技大學		碩士
2006	上海大學		博士

3、目前研究領域：
算法與計算機軟件；機器學習與智能控制
4、已完成或已在承擔的主要課題：
1. 智能體在部分可觀測馬爾可夫環(huán)境下的激勵學習研究，國家自然科學基金,2002-2005 2. 多時間尺度風險敏感度MDP研究，理工大學科研基金 3. 湖南省青年骨干教師培養(yǎng)對象，湖南省教育廳 4. 1火力發(fā)電廠分布式數(shù)據(jù)采集與故障診斷系統(tǒng)，湖南省電力局科研項目（1998年），已結題，6萬元，主持。 5. 智能體在部分可觀測馬爾可夫環(huán)境下的激勵學習研究，國家自然科學基金項目，在研，20萬元，主研。 6. 江西省地區(qū)電網(wǎng)負荷預測與分析系統(tǒng)，江西省電力總公司，已結題，50萬元，主研。 7. 教學管理軟件的開發(fā)與推廣，長沙電力學院教研項目（2000年），已結題，0.5萬元，主研。 8. 激勵學習算法的收斂性研究，湖南省教委科研項目（2000年），已結題，0.5萬元，主研。 9. 激勵學習智能體最優(yōu)控制策略及其在微經(jīng)濟環(huán)境下的決策問題，湖南省教育廳科研基金項目（2007），在研，1萬元，主持。 10. 7、多時間參數(shù)風險敏感度MDP研究，長沙理工大學科研基金項目（2006），在研，3萬元，主持。 11.
5、已出版的主要著作：

6、已發(fā)表的學術論文：
1. Optimal Equality for Multi-Time Scale Risk-Sensitive Markov Decision Processes，Proceedings inISCST, 2005,Ningbo, China 2. Automatic Discovery of Subgoals for Sequential Decision Problems Using Potential Fields，Proceedings in ICNC, 2005: 384-391. 3. 求解POMDP的動態(tài)合并激勵學習算法，計算機工程, No.19,2005 4. 基于動態(tài)規(guī)劃的激勵學習遺忘算法，計算機工程與應用, 2004,Vol 40, No.20 5. Reinforcement Learning Forgetting Algorithm Based on Dynamic Programming，Journal of Computer Engineering and Applications, 2004, Vol 40, No.20. 6. Average Asymptotic Temporal Difference Learning Forgetting Algorithm on Eligibility Trace, Journal of Changsha University of Electric Power, 2003 (4). 7. Reinforcement Learning Algorithm for Solving RTDP with Variational Environment. ICGST International Journal on Artificial Intelligence and Machine Learning (AIML), Volume (7), Issue (I), pp17-21. 8. Reinforcement Learning Algorithms Based on mGA and EA with Policy Iterations. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Bio-Inspired Computational Intelligence and Applications - International Conference on Life System Modeling and Simulation, LSMS 2007, Proceedings v 4688 LNCS 2007. 9. Risk-Sensitive Reinforcement Learning Algorithms with Generalized Average Criterion. Applied Mathematics and Mechanics-English Edition, 2007, V28, N3 ( MAR ) , pp405-416. 10. Global Attractor for KGS Lattice System.Applied Mathematics and Mechanics-English Edition, 2007, V28, N5 (MAC), pp619-628. 11. Fused Sarsa（lambda）Learning Algorithm Based-on Multi-agent. Journal of Computer Engineering and Applications,2008, 44 (4), pp182-183. 12. Automatic Discovery of Subgoals for Sequential Decision Problems Using Potential Fields. 2005 International Conference on Natural Computation/2005 International Conference on Fuzzy Systems and knowledge Discovery (ICNC'05-FSKD'05), IEEE. 27-29 August 2005, Changsha , China . (Lecture Notes in Computer Science, v 3612, n PART III, Advances in Natural Computation: First International Conference, ICNC 2005. Proceedings, 2005, pp384-391) 13. Optimal Equality for Multi-Time Scale Risk-Sensitive Markov Decision Processes. Proceedings in the International Symposium on Computer Science and Technology 2005, Ningbo , China . 14. Reinforcement Learning Algorithm Based-on Policy Iteration for Solving RTDP. 2006.8, ISAI’2006, Beijing , China . 15. U-Clustering: A Reinforcement Learning Algorithm Based on Utility Clustering. Journal of Computer Engineering and Applications, 2005, No.20. 16. Reinforcement Learning Forgetting Algorithm Based on Dynamic Programming. Journal of Computer Engineering and Applications, 2004, No.20. 17. The Dynamic Merge Reinforcement Learning Algorithm for Solving POMDP. Journal of Computer Engineering. 2005, 11. 18. Multi-Time Scale Risk-Sensitive Hierarchical Structure Control Problem. DCABES2006, Hangzhou , China , 2006.10. 19. Utility Clustering for Reinforcement Learning with Partial Observability. In Proceedings of Conference of Chinese Intelligence Automatization, HongKong , China , 2003.(IJCAI03). 20. Average Asymptotic Temporal Difference Learning Forgetting Algorithm on Eligibility Trace, Journal of Changsha University of Electric Power, 2003 (4). 21. Nonlinear Control Based on Q-learning Algorithms. Journal of Changsha University of Electric Power, Val.18, No.1, 2003 (1). 22. A Relative Value Iteration Q-Learning Algorithm and Its Convergence Based-on Finite Samples. Journal of Computer Research and Development. Sept.2002, Vol.39, No.9. 23. Optimality Cost Relative Value Iteration Q-Learning Algorithm Based on Finite Samples. Journal of Computer Engineering and Applications, 2002, No.14. 24. Generalize Average Algorithm for Reinforcement Learning Its Convergence. Journal of Computer Engineering and Applications, 2002, No.20. 25. Reinforcement Learning Algorithm Based on average Cost Optimization for Each Stage. Journal of Computer Applications, Val.22, No.4, 2002 (4). 26. Classification for Un-labeled Context Based on Maximum Expectation Learning Algorithm. Proceedings of 14th CDC (Annul Conference of Control and Decision, China ). 27. ATD(lambda) Learning Forgetting Algorithm. Proceedings of 4th Machine and Electric Engineering Association of Hunan , China , Aug. 2002. 28. Distributed Real-time System for Electric Power Enterprise Based on Intranet/Web. Journal of Applications of the Computer Systems, 2002(4). 29. The Uniform of Security Policy in Distributed System. Journal of Information Engineering University , 2001. (Proceedings of Annual Conference of Chinese Networks and Information Security, Zhengzhou , China , 2001). 30. Design of Distributed Real Time Database System Based on JDBC/Web. Journal of Computer Development and Applications. 2001,No.36. 31. The Application Delphi Multi-thread for Distributed Real time Multi-task System. Journal of Changsha University of Electric Power, Val.15, No.1, 2001 (1). 32. Comparing ARP of IPv4 with Neighbor Discovery Protocol of IPv6. Journal of Changsha University of Electric Power, Val.16, No.1, 2001 (1). 33. Study and Application of Distributed Real Time Multimedia Database. Journal of Changsha University of Electric Power, Val.16, No.2, 2001 (2). 34. The Design of Real-time Monitor Database System Based on Distributed Heterogeneous Networks Environment. Journal of Changsha University of Electric Power, Val.16, No.3, 2001 (3). 35. Distributed Real-time Multi-task System Study and Application for Monitoring and Supervising in Electric Power Plant. Proceedings of 1st Machine and Electric Engineering Association of Hunan , China , Aug, 1999. 36. The Principles and Design Methods for Domain Service System of Campus Networks. Journal of Changsha University of Electric Power, Val.13, No.1, 1998(1). 37. Security Study for Windows NT Network Management. Journal of Changsha University of Electric Power, Val.13, No.2, 1998(2). 38. The Weighed Lorentz Norm Inequality of Generalization Maximum Operator. Annual of Hunan Mathematics, Val 17, No.2, 1997. 39. The Weighted boundary of Operator and its interpolation on Mixed Lebesgue Space. Journal of Changsha University of Electric Power, Val.12, No.3, 1997 (3). 40. The Alternativeness of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.8, No.2, 1993 (2). 41. The Combiner Theory of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.6, No.2, 1991 (2). 42. The Equivalence Conditions for Reductionable Elements on Complex Commutative Banach Algebra. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.1, 1990 (1). 43. F-Set on Unit square-cube under n-Dimension Euclid Space. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.2, 1990 (2).
7、所獲學術榮譽及學術影響：
1. 1998年度獲長沙電力學院“優(yōu)秀教師” 2. 1998年度獲系“優(yōu)秀畢業(yè)實習指導教師” 3. 2000年度獲長沙電力學院“優(yōu)秀教師” 4. 2000年度獲長沙電力學院“優(yōu)質(zhì)課獎” 5. 2001年度獲長沙電力學院“優(yōu)秀教師” 6. 2002年度獲長沙電力學院“優(yōu)秀教師” 7. 2002年度獲“華中電力集團獎教基金獎”三等獎 8. 2003年度湖南省高等學校青年骨干教師培養(yǎng)對象

上一篇：蔣加伏教授

下一篇：徐學軍教授