Notice
Recent Posts
Recent Comments
Link
일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | |||
5 | 6 | 7 | 8 | 9 | 10 | 11 |
12 | 13 | 14 | 15 | 16 | 17 | 18 |
19 | 20 | 21 | 22 | 23 | 24 | 25 |
26 | 27 | 28 | 29 | 30 | 31 |
Tags
- Linear algebra
- 강화학습
- 유니티
- neural network
- David Silver
- ML-Agent
- Hessian Matrix
- optimization
- Python Programming
- 데이터 분석
- list
- Series
- Jacobian Matrix
- 논문
- paper
- 모두를 위한 RL
- 딥러닝
- 판다스
- Laplacian
- machine learning
- reinforcement learning
- 사이킷런
- rl
- Deep Learning
- convex optimization
- statistics
- 리스트
- 김성훈 교수님
- pandas
- unity
Archives
RL Researcher
Tags
- reinforcement learning
- 강화학습
- Python Programming
- pandas
- machine learning
- David Silver
- 판다스
- 데이터 분석
- 모두를 위한 RL
- Deep Learning
- list
- convex optimization
- 리스트
- 김성훈 교수님
- ML-Agent
- 사이킷런
- 딥러닝
- Linear algebra
- 유니티
- rl
- statistics
- neural network
- unity
- optimization
- 논문
- Series
- paper
- Laplacian
- Hessian Matrix
- Jacobian Matrix
- Differentiability
- Row Exchanges
- Triangle Factors
- Bounded
- Inverse matrix
- hyperplane
- Machine Learning Statistics
- Basic Statistics
- independent and identically distributed sample
- Law of Large Number
- line segment
- 볼록 최적화
- 강화학습 자료
- Mathmatics
- 다중선택문제
- Actor-Critic
- Policy Gradient
- Deep Mind
- Playing Atari with Deep Reinforcement Learning
- Atari Game
- Human-level control through deep reinforcement learning
- Unity Ml
- Deep-Q-Network
- Q-Network
- nondeterministic
- Windy Frozen Lake
- Q-Learning Table
- Q-Learning Algorithm
- Dummy Q-Learning
- 모두를 위한 RL 강좌
- Bellman Expectation Equation
- 근사 가치함수
- Value Function Approximation
- Model-Free
- Model-Free Control
- Model-Free Prediction
- Planning by Dynamic Programming
- Lecture 2
- MinMaxScaler
- StandardScaler
- 원-핫 인코딩
- OneHotEncoder
- 레이블 인코딩
- LabelEncoder
- Cross-validation
- K-Fold Cross-validation
- in키워드
- 판다서
- tail()
- head()
- K-Nearest-Neighbor
- 선형회귀분석
- Ensemble Learning
- 붓꽃 데이터
- train_test_split
- Mean Absolute
- 평균 제곱근 오차
- sigmoid함수
- Mean Square Error
- 평균제곱오차
- 대수의 법칙
- stacking
- k-armed bandit
- multiclass classification
- bagging
- 관계 연산자
- loss function
- decisiontree
- eigenvector
- eigenvalue
- 피처 스케일링
- 과적합
- binary classification
- Markov Decision Process
- Markov Decision Processes
- classifier
- enumerate
- 초평면
- Support vector machine
- preprocessing
- overfitting
- sklearn
- 비지도학습
- tuple
- randomForest
- reinforcement
- 베이즈 정리
- 동적 계획법
- 의사결정나무
- Jupyter Notebook
- scikit-learn
- 베이즈
- 지도학습
- Unsupervised Learning
- Supervised Learning
- 역행렬
- 머신러닝
- 교차검증
- boosting
- Statistic
- Stochastic
- 고유벡터
- 고유값
- 확률론
- linear regression
- MAb
- 가중치
- 결정계수
- 기계학습
- DQN
- 전처리
- while문
- 선형대수
- SST
- mdp
- boolean
- MSE
- scalar
- SVM
- RMSE
- IID
- classification
- float
- 선분
- for문
- 문자열
- 튜플
- SSR
- SSE
- bias
- Sequence
- limits
- 앙상블
- sum
- MRP
- Module
- 선형대수학
- 모듈
- 딕셔너리
- 반복문
- 조건문
- Gradient
- regression
- mp
- Artificial Intelligence
- iNT
- KNN
- 편향
- numpy
- function
- String
- 집합
- None
- Computer Science
- DP
- nn
- set
- iris
- 인공지능
- Mae
- dl
- product
- 함수
- 수열
- 최적화
- R2
- If
- 표준화
- WEIGHT
- Dictionary