60 #Interview #Questions On #MachineLearning https://analyticsindiamag.com/60-in

Data Science 2020. 2. 10. 17:15

https://analyticsindiamag.com/60-interview-questions-on-machine-learning/

'Data Science' 카테고리의 다른 글

I open-sourced a comprehensive guide to prepare for DataScience/AI interviews. T (0)	2020.08.07
Hi DataScience enthusiast . Are you fresher or professional looking out to make your path as "Data Scientist" and here something for you for upcoming 30days(prepare yourself and get hired ) . DataScience interview questions #day25 . If you missed #day1 .. (0)	2019.11.26
Towards Data Science(TDS)에서 지금까지 올라온 주옥같은 포스트들을 주제별로 분류하여 제공하였습니다. https://towardsdatascience.com/learn-on-towards-data-science-52245bc91451 (0)	2019.11.26
Hi DataScience enthusiast . Are you fresher or professional looking out to make your path as "Data Scientist" and here something for you for upcoming 30days(prepare yourself and get hired ) . DataScience interview questions #day13 . If you missed #day1 .. (0)	2019.11.15
python for data science cheat sheet using pandas (0)	2019.11.11

Posted by uniqueone

,

Machine Learning From Scratch GitHub, by Erik Linder-Norén : https://github.com

Deep Learning/course 2020. 2. 10. 15:55

Machine Learning From Scratch

GitHub, by Erik Linder-Norén : https://github.com/eriklindernoren/ML-From-Scratch

#ArtificialIntelligence #DeepLearning #MachineLearning

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Machine Learning From Scratch

About

Python implementations of some of the fundamental Machine Learning models and algorithms from scratch.

The purpose of this project is not to produce as optimized and computationally efficient algorithms as possible but rather to present the inner workings of them in a transparent and accessible way.

Installation

$ git clone https://github.com/eriklindernoren/ML-From-Scratch $ cd ML-From-Scratch $ python setup.py install

Examples

Polynomial Regression

$ python mlfromscratch/examples/polynomial_regression.py

Figure: Training progress of a regularized polynomial regression model fitting
temperature data measured in Linköping, Sweden 2016.

Classification With CNN

$ python mlfromscratch/examples/convolutional_neural_network.py +---------+ | ConvNet | +---------+ Input Shape: (1, 8, 8) +----------------------+------------+--------------+ | Layer Type | Parameters | Output Shape | +----------------------+------------+--------------+ | Conv2D | 160 | (16, 8, 8) | | Activation (ReLU) | 0 | (16, 8, 8) | | Dropout | 0 | (16, 8, 8) | | BatchNormalization | 2048 | (16, 8, 8) | | Conv2D | 4640 | (32, 8, 8) | | Activation (ReLU) | 0 | (32, 8, 8) | | Dropout | 0 | (32, 8, 8) | | BatchNormalization | 4096 | (32, 8, 8) | | Flatten | 0 | (2048,) | | Dense | 524544 | (256,) | | Activation (ReLU) | 0 | (256,) | | Dropout | 0 | (256,) | | BatchNormalization | 512 | (256,) | | Dense | 2570 | (10,) | | Activation (Softmax) | 0 | (10,) | +----------------------+------------+--------------+ Total Parameters: 538570 Training: 100% [------------------------------------------------------------------------] Time: 0:01:55 Accuracy: 0.987465181058

Figure: Classification of the digit dataset using CNN.

Density-Based Clustering

$ python mlfromscratch/examples/dbscan.py

Figure: Clustering of the moons dataset using DBSCAN.

Generating Handwritten Digits

$ python mlfromscratch/unsupervised_learning/generative_adversarial_network.py +-----------+ | Generator | +-----------+ Input Shape: (100,) +------------------------+------------+--------------+ | Layer Type | Parameters | Output Shape | +------------------------+------------+--------------+ | Dense | 25856 | (256,) | | Activation (LeakyReLU) | 0 | (256,) | | BatchNormalization | 512 | (256,) | | Dense | 131584 | (512,) | | Activation (LeakyReLU) | 0 | (512,) | | BatchNormalization | 1024 | (512,) | | Dense | 525312 | (1024,) | | Activation (LeakyReLU) | 0 | (1024,) | | BatchNormalization | 2048 | (1024,) | | Dense | 803600 | (784,) | | Activation (TanH) | 0 | (784,) | +------------------------+------------+--------------+ Total Parameters: 1489936 +---------------+ | Discriminator | +---------------+ Input Shape: (784,) +------------------------+------------+--------------+ | Layer Type | Parameters | Output Shape | +------------------------+------------+--------------+ | Dense | 401920 | (512,) | | Activation (LeakyReLU) | 0 | (512,) | | Dropout | 0 | (512,) | | Dense | 131328 | (256,) | | Activation (LeakyReLU) | 0 | (256,) | | Dropout | 0 | (256,) | | Dense | 514 | (2,) | | Activation (Softmax) | 0 | (2,) | +------------------------+------------+--------------+ Total Parameters: 533762

Figure: Training progress of a Generative Adversarial Network generating
handwritten digits.

Deep Reinforcement Learning

$ python mlfromscratch/examples/deep_q_network.py +----------------+ | Deep Q-Network | +----------------+ Input Shape: (4,) +-------------------+------------+--------------+ | Layer Type | Parameters | Output Shape | +-------------------+------------+--------------+ | Dense | 320 | (64,) | | Activation (ReLU) | 0 | (64,) | | Dense | 130 | (2,) | +-------------------+------------+--------------+ Total Parameters: 450

Figure: Deep Q-Network solution to the CartPole-v1 environment in OpenAI gym.

Image Reconstruction With RBM

$ python mlfromscratch/examples/restricted_boltzmann_machine.py

Figure: Shows how the network gets better during training at reconstructing
the digit 2 in the MNIST dataset.

Evolutionary Evolved Neural Network

$ python mlfromscratch/examples/neuroevolution.py +---------------+ | Model Summary | +---------------+ Input Shape: (64,) +----------------------+------------+--------------+ | Layer Type | Parameters | Output Shape | +----------------------+------------+--------------+ | Dense | 1040 | (16,) | | Activation (ReLU) | 0 | (16,) | | Dense | 170 | (10,) | | Activation (Softmax) | 0 | (10,) | +----------------------+------------+--------------+ Total Parameters: 1210 Population Size: 100 Generations: 3000 Mutation Rate: 0.01 [0 Best Individual - Fitness: 3.08301, Accuracy: 10.5%] [1 Best Individual - Fitness: 3.08746, Accuracy: 12.0%] ... [2999 Best Individual - Fitness: 94.08513, Accuracy: 98.5%] Test set accuracy: 96.7%

Figure: Classification of the digit dataset by a neural network which has
been evolutionary evolved.

Genetic Algorithm

$ python mlfromscratch/examples/genetic_algorithm.py +--------+ | GA | +--------+ Description: Implementation of a Genetic Algorithm which aims to produce the user specified target string. This implementation calculates each candidate's fitness based on the alphabetical distance between the candidate and the target. A candidate is selected as a parent with probabilities proportional to the candidate's fitness. Reproduction is implemented as a single-point crossover between pairs of parents. Mutation is done by randomly assigning new characters with uniform probability. Parameters ---------- Target String: 'Genetic Algorithm' Population Size: 100 Mutation Rate: 0.05 [0 Closest Candidate: 'CJqlJguPlqzvpoJmb', Fitness: 0.00] [1 Closest Candidate: 'MCxZxdr nlfiwwGEk', Fitness: 0.01] [2 Closest Candidate: 'MCxZxdm nlfiwwGcx', Fitness: 0.01] [3 Closest Candidate: 'SmdsAklMHn kBIwKn', Fitness: 0.01] [4 Closest Candidate: ' lotneaJOasWfu Z', Fitness: 0.01] ... [292 Closest Candidate: 'GeneticaAlgorithm', Fitness: 1.00] [293 Closest Candidate: 'GeneticaAlgorithm', Fitness: 1.00] [294 Answer: 'Genetic Algorithm']

Association Analysis

$ python mlfromscratch/examples/apriori.py +-------------+ | Apriori | +-------------+ Minimum Support: 0.25 Minimum Confidence: 0.8 Transactions: [1, 2, 3, 4] [1, 2, 4] [1, 2] [2, 3, 4] [2, 3] [3, 4] [2, 4] Frequent Itemsets: [1, 2, 3, 4, [1, 2], [1, 4], [2, 3], [2, 4], [3, 4], [1, 2, 4], [2, 3, 4]] Rules: 1 -> 2 (support: 0.43, confidence: 1.0) 4 -> 2 (support: 0.57, confidence: 0.8) [1, 4] -> 2 (support: 0.29, confidence: 1.0)

Implementations

Supervised Learning

Unsupervised Learning

Reinforcement Learning

Deep Q-Network

Deep Learning

Neural Network
Layers
- Activation Layer
- Average Pooling Layer
- Batch Normalization Layer
- Constant Padding Layer
- Convolutional Layer
- Dropout Layer
- Flatten Layer
- Fully-Connected (Dense) Layer
- Fully-Connected RNN Layer
- Max Pooling Layer
- Reshape Layer
- Up Sampling Layer
- Zero Padding Layer
Model Types

Contact

If there's some implementation you would like to see here or if you're just feeling social, feel free to email me or connect with me on LinkedIn.

'Deep Learning > course' 카테고리의 다른 글

We decided to make most of our Deep Learning Teaching Materials freely available (0)	2020.03.27
CS330 전과정 강의비디오가 유튜브에 업로드되었습니다. https://www.youtube.com/playlist?list=PLoROMvod (0)	2020.02.26
매우 좋은 AI 학습사이트! 무료에 1:1 튜터 형식으로 재미도 있고! 비록 영어 동영상이지만, 영어 자막 제공하고 있어서 이해하기도 쉽고 (0)	2020.02.01
YouTube에서 '머신러닝/딥러닝 실전 입문' 보기 윤인성씨 강의 (0)	2020.01.07
This course uses pytorch instead of tensorflow.[ [](https://t.co/lahLZspADM?amp (0)	2019.12.31

Posted by uniqueone

,

안녕하세요 Lidar SLAM공부중인 김기섭 입니다 # 지난주에 SLAM덩크 스터디에서 이종훈 님께서 레인지넷에 대해 소개해주셨는데요, 1

SLAM/resources 2020. 2. 10. 08:06

안녕하세요 Lidar SLAM공부중인 김기섭 입니다

#
지난주에 SLAM덩크 스터디에서 이종훈 님께서 레인지넷에 대해 소개해주셨는데요,
19 ICCV에 소개된 시맨틱 키티라는 point-wise 로 fully labeled 된 dataset이 있었기에 가능했던 연구였습니다.

#
시맨틱키티는
28가지 종류의 라벨 (도로, 폴, 인도, 자동차, 사람 등) 및
movable 한 object의 경우 (자동차, 사람 등) instance ID 또한 제공합니다.
이 정보를 이용하면 기존 segmentation work 뿐 아니라 object tracking 또는 dynamic/static 판별 등의 연구도 진행할 수 있을 것 같네요.
예시 영상입니다 https://youtu.be/gNeEfPEyHuw

#
저자가 공개한 파이썬API도 공개되어 있습니다.
https://github.com/PRBonn/semantic-kitti-api
저는 개인적으로 매트랩을 선호해서 매트랩 API도 간단히 만들어보았습니다.
https://github.com/kissb2/semantickitti-matlab

#
시맨틱 라이다 슬램 연구 관심 많으신분들 함께해요
감사합니다.

'SLAM > resources' 카테고리의 다른 글

1. SLAM이란? https://cv-learn.com/SLAM-f16a75f894ca48d3aa851bca99ec7cce 2. SLAM을 (0)	2020.02.23
안녕하세요 SLAM공부 김기섭입니다. # 지난번에 KITTI dataset 을 열어보면서 SLAM 공부 시작해보면 좋다고 전해드렸었는데요 (0)	2020.02.19
SLAM이 처음이신분들은 SLAM의 수학적 원리 이해도 좋지만 다양한 데이터셋을 받고, 열어보고, 만져보는거부터 출발하셔도 좋을거같아요 (0)	2020.01.30
안녕하세요 SLAM공부 김기섭입니다. 논문이 너무 많은 시대입니다. 최신 논문 중 뭘 읽어야 할지 따라가기가 늘 힘들었는데 아래 슬라이 (0)	2020.01.28
Visual Odometry 방법 중 direct method를 사용한 Direct Sparse Odometry(DSO) 논문의 내용 일부분을 (0)	2020.01.06

Posted by uniqueone

,

안녕하세요! 최근 ImageNet 에 SOTA 를 찍은 논문입니다. https://arxiv.org/abs/1911.04252 캐글에서 많

Deep Learning/Kaggle 2020. 2. 10. 08:03

안녕하세요!

최근 ImageNet 에 SOTA 를 찍은 논문입니다.

https://arxiv.org/abs/1911.04252

캐글에서 많이 보던 pseudo-labeling 이 들어가있네요!

대략 정리하면

1. Train the teacher model with labeled data

2. Generate pseudo label for unlabeled data with teacher model

3. Train the student model with both labeled data and unlabeled data

4. Generate pseudo label for unlabeled data with teature model (student model in step 3)

5. Train the student model with both labeled data and unlabeled data

위 과정을 계속 반복합니다 ㅎ

성능이 매우 좋아지네요!

캐글에서 메달 따려면, SOTA 논문을 잘 보고, 그대로 쓰면 되겠죠?

'Deep Learning > Kaggle' 카테고리의 다른 글

안녕하세요 Bengali.AI 대회에 대해 영상을 3개 더 찍었네요 Multi label 일 때, stratified folding 을 할 (0)	2020.03.04
👨‍🏫 안녕하세요 데이콘의 경진대회 수상작들 중에 PDF로 만들어진 내용을 슬라이드 쉐어에 모아 봤습니다. 🥳특히 시각화 대회 수상작들을 (0)	2020.02.28
KB금융그룹 문자 분석 경진대회의 수상자가 가려졌습니다. 수고하셨습니다. 코로나 때문에 아쉽게 시상식은 열리지 못했지만 데이콘에서 2주 후 (0)	2020.02.07
안녕하세요! deepfake 대회가 진행되고 있지만, 쉽지 않은 대회인 것 같습니다. 저어어엉~말 간단하게 프로세스를 정리해봤습니다. (0)	2020.02.06
안녕하세요, 이정윤입니다. 오늘은 지난 시간에 이어 [#Kaggle](https://www.facebook.com/hashtag/kaggle? (0)	2020.02.03

Posted by uniqueone

,

Pytorch로 image classification 작업하는 분들을 위해 좋은 github을 소개합니다. https://github.com/

Deep Learning/PyTorch 2020. 2. 10. 08:02

https://www.facebook.com/groups/PyTorchKR/permalink/1620762534730088/

Pytorch로 image classification 작업하는 분들을 위해 좋은 github을 소개합니다.

https://github.com/rwightman/pytorch-image-models

(pip install timm이나 git clone https://github.com/rwightman/pytorch-image-models 등으로 사용하실 수 있습니다.)

이전에도 efficientnet code에 대해서 소개해드린 코드베이스입니다.

EfficientNet, ResNet, HRNet, SelecSLS등 다양한 모델들과 pretrained weight들을 갖추고 있으며, 모듈러 구조로 다양한 batchnorm, activation, 트레이닝트릭등(Autoaugment 등) 을 쉽게 사용할 수 있도록 짜여있습니다.

개인이 만든것이라고 생각하기에는 놀라울정도로 다양한 모델들과 아키텍쳐를 갖추고 있는데요, maintainer 본인이 여러 업무나 kaggle대회를 진행하거나 논문들을 살펴보면서 맞이하는 코드등을 쌓아가면서 코드를 형성했다고합니다.

최근에 네이버 클로버ai에서 resnet을 재구성하여 이전에 sota였던 efficientnet을 능가하는 assemblenet이 나온 후, 여기 maintainer도 tensorflow로 구성되어있던 원래의 코드를 pytorch로 구현하는 작업에 매진하고 있는 모양입니다.

현재는 selective kernel network를 구현하고 있는데요 이미 다양한 최적화, 모듈화가 가능한 자신의 코드베이스에 맞게 재구성중인 모양입니다.

저 개인적으로는 해당 assemblenet 논문의 결론부에서 언급된 ECA- Net(efficient channel attention)을 나름대로 재구성해서 해당 github에 pull request를 보내놓은 상태입니다.

이 과정에서 원래 ECA 논문(https://arxiv.org/pdf/1910.03151.pdf)
의 잠재적인 한계에 봉착하였습니다.

ECA 원문에서는 channel간의 결과를 convolution할때 kernel size에 맞게 zero padding하게 됩니다.

그러나 채널들간의 관계는 인접한 픽셀이나 feature map과는 다르기 때문에 zero padding하는게 무의미하다고 생각하였습니다.

"첫"채널이나 "마지막"채널은 기하학적이거나 위상적인 의미를 지니지 않을 것이기 때문에 더 zeropadding하여 더 적은 channel과 convolution하기보다는 circular padding을 통해 모든 채널이 같은 갯수의 서로 다른 채널들과 convolution하는게 논리적으로 맞다고 생각합니다.

그러한 논리적 접근에 따라 circular ECA module을 구현하여 함께 pull request를 하였지만, 꼭 해당 코드에만 사용해야하는 것은 아니라 이러한 attention module(ECA든 cECA든) 이 적합하다고 생각하시는 분이라면 참고해서 쓰실 수있으셨으면 좋겠습니다.

ECA PR: https://github.com/rwightman/pytorch-image-models/pull/82

해당 repo를 fork하여 eca를 구현한 제 코드

https://github.com/VRandme/pytorch-image-models/tree/eca

ECA가 SE, CBAM등 여태까지 나온 대부분의 attention model보다 우수한 성능을 보여주었는데, 저는 아직도 spatial attention을 구현한 CBAM에 미련이 남습니다.

그래서 이 코드가 정리되면 CBAM의 채널 attention부분만 ECA로 교체한 ECABAM?을 구현해볼 예정입니다.

안타깝게도 아직은 ImageNet수준의 트레이닝/테스팅을 수행할만한 여건이 되지 않아서 유의미한 진행은 어려울것같은데요, 곧 google collab에서 9.99$/mo로 무료 서비스와 차별화된 서비스를 제공한다고하니 한국에도 출시되면 알아봐야겠습니다.

관심있는 분들 살펴봐주시고 잘못된 부분등은 얼마든지 지적해주시기바랍니다. 감사합니다.

@Jong Chan Park

'Deep Learning > PyTorch' 카테고리의 다른 글

안녕하세요, 학습을 끝낸 모델을 저장했다가 다시 불러오면 accuracy가 현저히 떨어지는 문제가 발생하는데 도무지 원인을 알 수가 없어 질문 (0)	2020.04.14
PyTorch 기초 https://github.com/vahidk/EffectivePyTorch (0)	2020.04.07
Get started with PyTorch, Cloud TPUs, and Colab Joe Spisak : https://medium.com/ (0)	2020.03.18
안녕하세요 PyTorch를 시작한지 얼마 안되는 뉴비입니다. PyTorch로 대표적인 논문 모델들을 구현하며 처음 겪었던 어려웠던 점은 '레 (0)	2020.03.09
[Pytorch] 초보가 초보에게 : 어떻게 파일을 나눠야할까? Kaggle로 ML과 시각화를 하던 저에게 Pytorch는 조금 큰 장벽이었습 (0)	2020.03.08

Posted by uniqueone

,

Be the only one, not the best one

'2020/02/10'에 해당되는 글 5건

60 #Interview #Questions On #MachineLearning https://analyticsindiamag.com/60-in

'Data Science' 카테고리의 다른 글

Machine Learning From Scratch GitHub, by Erik Linder-Norén : https://github.com

Machine Learning From Scratch

About

Table of Contents

Installation

Examples

Polynomial Regression

Classification With CNN

Density-Based Clustering

Generating Handwritten Digits

Deep Reinforcement Learning

Image Reconstruction With RBM

Evolutionary Evolved Neural Network

Genetic Algorithm

Association Analysis

Implementations

Supervised Learning

Unsupervised Learning

Reinforcement Learning

Deep Learning

Contact

'Deep Learning > course' 카테고리의 다른 글

안녕하세요 Lidar SLAM공부중인 김기섭 입니다 # 지난주에 SLAM덩크 스터디에서 이종훈 님께서 레인지넷에 대해 소개해주셨는데요, 1

'SLAM > resources' 카테고리의 다른 글

안녕하세요! 최근 ImageNet 에 SOTA 를 찍은 논문입니다. https://arxiv.org/abs/1911.04252 캐글에서 많

'Deep Learning > Kaggle' 카테고리의 다른 글

Pytorch로 image classification 작업하는 분들을 위해 좋은 github을 소개합니다. https://github.com/

'Deep Learning > PyTorch' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

달력

링크

티스토리툴바