'분류 전체보기' 카테고리의 글 목록 (5 Page)

Notice

Recent Posts

Recent Comments

Link

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Tags more

Archives

관리 메뉴

목록분류 전체보기 (88)

RL Researcher

Lecture 2 : Convex Optimization (Stanford)

Convex sets affine and convex sets some important examples operations that preserve convexity generalized inequalities separating and supporting hyperplanes dual cones and generalized inequalities Affine set 아래에 그림의 $x_{1}$과 $x_{2}$를 통하는 모든 점들을 Affine set이라고 부름. $$x = \theta x_{1}+(1-\theta)x_{2}\ \ \ \ \ (\theta \in R)$$ affine set : set에서 두 개의 다른 점을 통과하는 선을 포함함. example : linear equations $\le..

Optimization/Stanford Lecture 2021. 3. 14. 20:41

Lecture 1-Convex Optimization (Stanford) Question

Question 1 The symbol x⋆ usually denotes a feasible point the optimal value of the problem a solution Question 2 Least squares is a special case of convex optimization. true false Question 3 Almost any problem you'd like to solve in practice is convex. true false Question 4 Convex optimization problems are attractive because they always have a unique solution. true false Question 5 In device siz..

Optimization/Stanford Lecture 2021. 3. 12. 22:26

Convex optimization

1.1 Mathematical optimization 수학적 최적화 문제, 또는 그냥 최적화 문제는 다음과 같은 형식을 가집니다. vector $x=(x_{1},...,x_{n})$ called optimization variable(최적화 변수). function $f_0 : R^{n}\rightarrow R$ called objective function(목적함수). function $f_{i} : R^{n}\rightarrow R, \ \ i=1,...,m$ called (inequality)constraint functions(부등식)제약함수. constant $b_{1},...,b_{m}$ constraints에 대한 limits or bounds. vector $x^{*}$은 called so..

Optimization/Convex Optimization 2021. 3. 10. 10:42

강화학습 공부 자료

강화학습에 대한 공부를 하던 중 다 공부하던 것들은 재생목록에서 지우다 보니 기록을 남겨야겠다 싶어서 작성합니다. 강의 모두를 위한 RL - 김성훈 교수님의 강의 팡요랩 강의 혁펜하임 강의 (수학적 정의를 보고싶으신 분들은 여기를 참조 부탁드립니다.) Deep Mind Reinforcement Learning Lecture CS234 : Reinforcement Learning SNU Reinforcement Learning Lecture Deep RL BootCamp 2017 Deep Reinforcement Learning : CS 285 Fall 2020 POSTECH Reinforcement Learning

Reinfrocement Learning 2021. 2. 27. 20:05

Prev 1 2 3 4 5 6 7 8 ··· 22 Next

목록분류 전체보기 (88)

RL Researcher

티스토리툴바