텐서플로

[python] Word Tokenization 단어 토큰화 2020.12.01
[google] 머신러닝 단기 집중 과정. 2018.03.01
[현장] 구글 ‘텐서플로우’ 개발자가 생각하는 머신러닝이란? 2018.02.21
[ML] TensorFlowKR/MLJejuCamp Final Presentation (2017. 07. 27 - 28) 2017.08.17
[ML] Machine Learning Jeju Camp http://mlcampjeju.com/ 2017.08.17
How to Make an Amazing Tensorflow Chatbot Easily 2017.07.28

[python] Word Tokenization 단어 토큰화

홍반장水_ 2020. 12. 1. 15:17

2020. 12. 1. 15:17

Word Tokenization 단어 토큰화

자연어 처리에서 크롤링 등으로 얻어낸 코퍼스 데이터가 필요에 맞게 전처리되지 않은 상태라면, 해당 데이터를 사용하고자하는 용도에 맞게 토큰화(tokenization) & 정제(cleaning) & 정규화(normalization)하는 일을 하게 됩니다. 이번 챕터에서는 그 중에서도 토큰화에 대해서 배우도록 합니다.

주어진 코퍼스(corpus)에서 토큰(token)이라 불리는 단위로 나누는 작업을 토큰화(tokenization)라고 부릅니다. 토큰의 단위가 상황에 따라 다르지만, 보통 의미있는 단위로 토큰을 정의합니다.

이 챕터에서는 토큰화에 대한 발생할 수 있는 여러가지 상황에 대해서 언급하여 토큰화에 대한 개념을 이해합니다. 뒤에서 파이썬과 NLTK 패키지, KoNLPY를 통해 실습을 진행하며 직접 토큰화를 수행해보겠습니다.


## word_tokenize는 Don't를 Do와 n't로 분리하였으며, 
## 반면 Jone's는 Jone과 's로 분리한 것을 확인할 수 있습니다.
>from nltk.tokenize import word_tokenize  
>print(word_tokenize("Don't be fooled by the dark sounding name, Mr. Jone's Orphanage is as cheery as cheery goes for a pastry shop."))  
['Do', "n't", 'be', 'fooled', 'by', 'the', 'dark', 'sounding', 'name', ',', 'Mr.', 'Jone', "'s", 'Orphanage', 'is', 'as', 'cheery', 'as', 'cheery', 'goes', 'for', 'a', 'pastry', 'shop', '.']  


## WordPunctTokenizer는 구두점을 별도로 분류하는 특징을 갖고 있기때문에, 앞서 확인했던
## word_tokenize와는 달리 Don't를 Don과 '와 t로 분리하였으며, 
## 이와 마찬가지로 Jone's를 Jone과 '와 s로 분리한 것을 확인할 수 있습니다.
>from nltk.tokenize import WordPunctTokenizer  
>print(WordPunctTokenizer().tokenize("Don't be fooled by the dark sounding name, Mr. Jone's Orphanage is as cheery as cheery goes for a pastry shop."))
['Don', "'", 't', 'be', 'fooled', 'by', 'the', 'dark', 'sounding', 'name', ',', 'Mr', '.', 'Jone', "'", 's', 'Orphanage', 'is', 'as', 'cheery', 'as', 'cheery', 'goes', 'for', 'a', 'pastry', 'shop', '.']  


## 케라스 또한 토큰화 도구로서 text_to_word_sequence를 지원합니다. 이번에는 케라스로 토큰화를 수행해봅시다.
## 케라스의 text_to_word_sequence는 기본적으로 모든 알파벳을 소문자로 바꾸면서 온점이나 
## 컴마, 느낌표 등의 구두점을 제거합니다. 하지만 don't나 jone's와 같은 경우 아포스트로피는 보존하는 것을 볼 수 있습니다.
>from tensorflow.keras.preprocessing.text import text_to_word_sequence
>print(text_to_word_sequence("Don't be fooled by the dark sounding name, Mr. Jone's Orphanage is as cheery as cheery goes for a pastry shop."))
["don't", 'be', 'fooled', 'by', 'the', 'dark', 'sounding', 'name', 'mr', "jone's", 'orphanage', 'is', 'as', 'cheery', 'as', 'cheery', 'goes', 'for', 'a', 'pastry', 'shop']


## 표준으로 쓰이고 있는 토큰화 방법 중 하나인 Penn Treebank Tokenization의 규칙에 대해서 소개하고, 토큰화의 결과를 보도록 하겠습니다.
## 규칙 1. 하이푼으로 구성된 단어는 하나로 유지한다.
## 규칙 2. doesn't와 같이 아포스트로피로 '접어'가 함께하는 단어는 분리해준다. 
>from nltk.tokenize import TreebankWordTokenizer
>tokenizer=TreebankWordTokenizer()
>text="Starting a home-based restaurant may be an ideal. it doesn't have a food chain or restaurant of their own."
>print(tokenizer.tokenize(text))
['Starting', 'a', 'home-based', 'restaurant', 'may', 'be', 'an', 'ideal.', 'it', 'does', "n't", 'have', 'a', 'food', 'chain', 'or', 'restaurant', 'of', 'their', 'own', '.']

저작자표시 비영리 (새창열림)

'프로그래밍 > Python' 카테고리의 다른 글

[Python] Jupyter 영화 리뷰 분류: 이진 분류 문제, 네이버 영화 리뷰 감성 분류 (0)	2020.12.02
[python] 한글 토큰화 (0)	2020.12.02
[python] matplotlib test (0)	2020.12.01
[python] pandas 외부csv 파일 읽기 (0)	2020.12.01
[python] Online python Compiler , 온라인 파이썬 컴파일러 (0)	2020.11.30

[google] 머신러닝 단기 집중 과정.

홍반장水_ 2018. 3. 1. 13:24

2018. 3. 1. 13:24

머신러닝 단기집중과정

텐서플로우 API 사용

Google의 실용적인 머신러닝 속성 입문 과정

https://developers.google.com/machine-learning/crash-course/

필수사항

머신러닝 단기집중과정에는 머신러닝에 관한 사전 지식이 필요하지 않습니다. 하지만 제시된 개념을 이해하고 실습을 완료하기 위해서는 다음과 같은 필수사항을 충족하는 것이 좋습니다.

기본 대수학 개념 숙지. 변수와 계수, 선형 방정식, 함수의 그래프, 히스토그램을 알고 있어야 합니다. 로그, 미분과 같은 고급 수학적 개념에 익숙하면 유용하지만 꼭 필요하지는 않습니다.

프로그래밍 기본사항 숙지 및 Python을 이용한 코딩 경험. 머신러닝 단기집중과정의 프로그래밍 연습은 텐서플로우를 이용하여Python으로 코딩되어 있습니다. 텐서플로우 사용 경험이 필요하지 않지만 함수 정의/호출, 목록과 사전, 루프, 조건식 등 기본 프로그래밍 구성체가 포함된 Python 코드를 쉽게 읽고 쓸 수 있어야 합니다.

참고: 머신러닝 단기집중과정에 사용되는 수학 및 프로그램 개념의 세부 목록은 각 개념의 참조 자료와 함께 아래의 주요 개념 및 도구 섹션을 참조하세요.사전 작업

프로그래밍 실습은 브라우저에서 바로 실행되므로 설정이 필요하지 않습니다. Colaboratory 플랫폼을 통해 실습이 실행됩니다. Colaboratory는 대부분의 주요 브라우저에서 지원되며 Chrome 및 Firefox 데스크톱 버전에서 가장 철저한 테스트를 거쳤습니다. 실습을 다운로드하여 오프라인에서 하시려면 다음의 안내에서 로컬 환경 설정에 관해 자세히 알아보세요.

Pandas 시작하기

머신러닝 단기집중과정의 프로그램 실습에서는 Pandas 라이브러리를 사용하여 데이터 세트를 조작합니다. Pandas에 익숙하지 않으면 실습에 사용된 주요 Pandas 기능이 설명된 실습에 사용된 주요 Pandas 기능이 설명된 Pandas 빠른 가이드를 참조하세요.

낮은 수준의 텐서플로우 기본 사항

머신러닝 단기집중과정의 프로그래밍 실습에서는 고급 tf.estimator API를 사용하여 모델을 구성합니다. 처음부터 텐서플로우 모델을 구성하려면 다음 가이드를 완료하세요.

TensorFlow Hello World 낮은 수준의 텐서플로우로 코딩된 'Hello World'.텐서플로우 프로그래밍 개념 텐서플로우 애플리케이션의 기본 구성요소 둘러보기: 텐서, 작업, 그래프, 세션.텐서 생성하고 조정하기 텐서 간이 입문서: 텐서플로우 프로그래밍의 중심 추상화. 선형 대수학의 행렬 덧셈과 곱셈에 관해서도 설명되어 있습니다.주요 개념 및 도구

머신러닝 단기집중과정에서는 다음과 같은 개념과 도구를 설명하고 적용합니다. 자세한 내용은 연결된 자료를 참조하세요.

수학대수학변수,
계수,
함수선형 방정식.
예: y=b+w1x1+w2x2>y=b+w1x1+w2x2로그 및 로그 방정식. 예: y=ln(1+ezy=ln(1+ez)시그모이드 함수선형 대수학텐서 및 텐서 차수행렬 곱셈삼각법Tanh (활성화 함수에 설명되어 있음, 사전 지식이 필요하지 않음)통계평균, 중앙값, 이상점, 표준 편차히스토그램을 읽을 수 있는 능력적분학(선택 사항, 고급 주제의 경우)미분의 개념 (실제로 미분을 계산할 필요는 없습니다.)경사도 또는 기울기편미분 (경사도와 긴밀한 관련이 있음)연쇄 법칙 (신경망 학습 시 역전파 알고리즘을 완전히 이해하기 위해 필요함)

Python 프로그래밍기본

Python

다음과 같은 Python 기본 사항이 Python 가이드에 설명되어 있습니다.

위치 및 키워드 매개변수를 사용한 함수 정의 및 호출

사전, 목록, 세트(만들기, 액세스, 반복)

for 루프, 반복 변수 여러 개를 사용한 for루프(예: for a, b in [(1,2), (3,4)])

if/else 조건부 블록 및 조건식

문자열 형식 지정 (예: '%.2f' % 3.14)

변수, 할당, 기본 데이터 유형 (int, float, bool, str(ing))

pass 문

중급 Python

다음과 같은 고급 Python 기능도 Python 가이드에 설명되어 있습니다.

목록 이해

람다 함수

타사 Python 라이브러리

머신러닝 단기집중과정 코드 예에서는 타사 라이브러리의 다음과 같은 기능을 사용합니다. 이러한 라이브러리에 관한 사전 지식은 필요하지 않습니다. 필요할 때 알아야 하는 것을 찾을 수 있습니다.

Matplotlib(데이터 시각화용)
pyplot 모듈
cm 모듈
gridspec 모듈
Seaborn(히트맵용)
heatmap 함수
Pandas(데이터 조작용)
DataFrame 클래스
NumPy(낮은 수준의 수학 연산용)
linspace 함수
random 함수
array 함수
arange 함수
scikit-learn(평가 측정항목용)
metrics 모듈
Bash 터미널/Cloud Console

로컬 머신이나 Cloud Console에서 프로그래밍 실습을 실행하려면 쉽게 명령줄을 사용할 수 있어야 합니다.

Bash 참조 매뉴얼Bash Cheatsheet셸 알아보기

. . .

저작자표시 (새창열림)

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

한국어 오픈소스 자연어처리 라이브러리 프로젝트 Koshort(코숏, 한국 길고양이를 부르는 애칭일지도..)을 소개합니다! (0)	2018.05.22
TensorFlow를 브라우저에서 실행시키는 공식 프로젝트입니다. (0)	2018.04.04
[현장] 구글 ‘텐서플로우’ 개발자가 생각하는 머신러닝이란? (0)	2018.02.21
챗봇 제작 기술 선택을 위한 최상의 안내서 (0)	2018.02.14
[AI] Stop Autonomous Weapons (0)	2017.11.20

[현장] 구글 ‘텐서플로우’ 개발자가 생각하는 머신러닝이란?

홍반장水_ 2018. 2. 21. 07:10

2018. 2. 21. 07:10

[현장] 구글 ‘텐서플로우’ 개발자가 생각하는 머신러닝이란?

http://www.bloter.net/archives/254962

– 텐서플로우와 다른 머신러닝 라이브러리와의 차이점은 무엇인가? 비슷한 질문으로 다른 라이브러리가 아닌 텐서플로우를 선택해야 하는 이유는 무엇인가?

= 모든 기술을 이용해봐라. 토치, 카페 등 다른 기술들을 다운로드 하는 건 어려운 게 아니다. 다양한 기술을 작은 예제와 함께 일단 사용해보라. 그러면 결국 여러분들의 상황에 적합한 기술을 찾을 수 있을 거다. 텐서플로우가 모든 문제를 해결할 수 있는 만능 기술은 아니다. 텐서플로우의 장점은 구글이 만든 2세대 머신러닝 기술이고, 100여명의 인력이 투여되는 기술이라는 점이다. 그 말은 기술 지원이 활발하고, 오류도 더 빨리 발견하고 개선할 수 있다는 뜻이다. 작은 회사의 직원이고 어떤 기술을 선택할 지 고민하고 있다면 텐서플로우가 분명 좋은 대안이 될 것이다. 큰 데이터 규모에서는 더더욱.

– 텐서플로우는 윈도우에서 이용할 수 없다. 향후 윈도우를 지원할 계획이 있는가?

= 계획에 대해서는 뭔가 확실하게 이야기할 게 없다. 하지만 나도 윈도우 기기를 가지고 있고, 텐서플로우를 이용하고 있다. 그래서 추천하는 방법은 VM웨어를 설치하라. 무료 아닌가.(웃음) 그 위에서 다른 운영체제를 설치하고 텐서플로우를 이용해라.

– 알파고가 ‘강화학습(Reinforcement Training)’ 알고리즘을 이용했다고 들었다. 텐서플로우에서 강화학습 혹은 알파고 같은 기술을 지원할 계획이 있는가?

= 텐서플로우는 미국 본사에서 개발한다. 딥마인드는 영국에서 개발한다. 두 팀 모두 구글이란 같은 회사에 속해 있지만 서로 멀리 떨어져 있고 시차가 있어서 빨리 소통하기 어렵다. 딥마인드팀은 자체적으로 기술을 개발하고 있기 때문에 딥마인드가 하는 일에 대해서 많이 알지 못한다. API나 기술 지원은 언젠가 가능할 것으로 본다. 딥마인는 텐서플로우를 포함한 구글의 소프트웨어를 이용하고 있다. 텐서플로우나 구글 고유의 기술을 사용하는 비중이 얼마나 되는지는 잘 모른다.

– ‘텐서플로우 플레이그라운드‘라는 것을 이용해 봤다. 자바스크립트로 만든 시각화 도구인데 매우 흥미로웠다. 이런 도구는 텐서플로우 입문자에게 좋을 것 같더라. 혹시 비슷하게 텐서플로우와 통합해 사용할 수 있는 모니터링 도구나 프로파일 도구를 알고 있는가?

= 텐서플로우 그래프를 이용할 때 갑자기 속도가 느려지는 것을 경험하는 사용자가 있을 것이다. 무엇 때문에 속도가 늦어졌는지 알기 위해서 모니터링 도구나 프로파일링 도구가 필요하다. 텐서플로우 내부 팀이 이용하는 프로파일링 도구가 따로 있긴 하다. 좋은 도구인데 아직 이걸 오픈소스 기술로 공개할지는 결정하지 못했다. 언젠가 오픈소스 기술로 전환할 것이나, 그게 언제일지는 아직 모른다. 하지만 우리팀도 사람들이 어떤 문제를 해결하고 싶은지 명확하게 이해하고 있다. 텐서플로우 플레이그라운드라는 도구는 나도 잘 알고 있다. 여기 있는 분들도 한 번쯤 이용해보셨으면 좋겠다. 어떤 구성요소가 어떻게 작용하는지 쉽게 알 수 있을 것이다.

텐서플로우 플레이그라운드

▲텐서플로우플레이그라운드. 구글과는 별개로 진행되는 외부 프로젝트다.

– 나는 맥 OS 사용자다. 텐서플로우를 이용하려고 했는데 AMD GPU를 지원하지 않더라. 향후 AMD GPU를 지원할 계획은 없는가?

= 계획에 대해 아는 것은 없다. 의존성 문제 때문에 기술을 변경하는 게 쉽지 않더라. 현재 말할 수 있는 가장 좋은 제안은 그냥 엔비디아 카드를 사는 것이다.(웃음)

– 구글의 딥러닝 프로젝트에 합류하려면 어떻게 해야 하는가? 지원자는 경력자여야 하나? 학위 같은 게 있어야 하나?

= 학위가 있으면 좋긴 할 것이다. 하지만 사실 텐서플로우팀은 정말 다양한 사람으로 구성됐다. 많은 사람들이 다른 국적을 가지고 있다. 모두가 일류 대학을 졸업하지도 않았다. 가장 중요한 것은 당연히 프로그래밍 실력이다. 너무 중요하다. 또 어떤 문제를 해결하고 싶어하는 마음이 있어야 한다. 열린 태도를 지니는 것도 중요한 부분이다. 사실 2015년 11월 ‘브레인 레지던시’라는 프로그램을 시작했다. 많은 사람이 이 프로그램에 지원했다. 그 중 30명이 1년간 브레인 레지던시 프로그램에 참여하게 됐다. 참여자들은 머신러닝을 배우고, 자신이 하고 싶은 프로젝트를 진행한다. 여건이 허락된다면 이 프로그램을 내년에도 진행할 계획이다. 여기 있는 많은 분들이 지원해주셨으면 좋겠다.

구글 브레인 레지던시 프로그램

– 박사님께서 음성인식 분야를 전공하신 걸로 알고 있다. 음성인식 분야에서 기계가 인간의 실력을 뛰어넘을 수 있을 것이라고 보는가?

= 이미 기계가 음성인식 분야에서 인간보다 앞서 있다. 같은 문장이라고 가정하면 캘리포니아 출신이냐 텍사스 출신이냐에 따라 말 소리가 매우 다르게 들린다. 많은 사람이 서로 다른 억양을 가진 언어를 잘 이해하지 못한다. 기계는 모든 억양을 이해한다. 실제로 우리가 인도 사람의 말을 받아 적으라고 해보니 사람보다 음성인식 기계가 더 잘 이해했다. 아마 앞으로 서기는 필요 없을지도 모른다. 기계가 더 잘 이해하는 이유는 크게 2가지다. 하나는 억양, 두 번째는 맥락 정보를 잘 알고 있기 때문이다. 남자 아이가 형에게 하는 말은 부모님은 가끔 이해 못할 것이다. 맥락 정보가 없기 때문이다. 기계는 가능하다. 물론 현재 음성인식 앱은 성능이 별로 안 좋을 수 있다. 하지만 앞으로 더 나아질 것이다. 특히 5-6년 안에 음성인식 서비스와 관련해서 엄청난 변화가 있을 것이다.

– 텐서플로우를 기업에서 직접 활용한 사례를 알고 있는가?

= 많은 기업이 텐서플로우를 내려받고 이용하고 있다. 하지만 그걸로 정확히 무엇을 하고 있는지는 알지 못한다. 사실 내가 안다고 해도 그걸 여기서 말할 수 없다. 기업 비밀이지 않은가. 텐서플로우에 관심을 가지고 있는 기업 종류는 특정 산업군에 국한되지 않으며 매우 다양하다.

– 텐서플로우로 챗봇 시스템을 만들 계획이 있는가?

= 챗봇은 아주 만들기 쉬운 기술이다. 챗봇은 아주 기본적인 언어 구조를 이용한다. 문자가 입력되면 답변을 예상하고 다시 질문과 답변을 주고받는 식이다. 이 일련의 과정을 포함한 언어 모델을 계속 훈련시키면 챗봇을 만들 수 있다. 이러한 개념을 일단 이해만 하면 챗봇이 얼마나 쉬운 기술인지 알 수 있을 것이다. 텐서플로우로 지금 당장 개발할 수 있다. 하지만 챗봇을 정말로 쓸모 있게 만드는 것은 또 다른 문제다. 복잡한 대화를 이해하려면 많은 지식을 알고 있어야 하기 때문이다. 현재 나온 챗봇은 바보같은 대화만 할 수 있다. ‘안녕’, ‘이름이 뭐예요?’ 정도만 묻는 식이다. 복잡한 대화는 이해하지 못한다.

저작자표시 (새창열림)

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

TensorFlow를 브라우저에서 실행시키는 공식 프로젝트입니다. (0)	2018.04.04
[google] 머신러닝 단기 집중 과정. (0)	2018.03.01
챗봇 제작 기술 선택을 위한 최상의 안내서 (0)	2018.02.14
[AI] Stop Autonomous Weapons (0)	2017.11.20
카카오, 스마트스피커 카카오미니 11월 7일 정식 발매 (0)	2017.10.31

[ML] TensorFlowKR/MLJejuCamp Final Presentation (2017. 07. 27 - 28)

홍반장水_ 2017. 8. 17. 09:29

2017. 8. 17. 09:29

[ML] TensorFlowKR/MLJejuCamp Final Presentation (2017. 07. 27 - 28)

Eunsoo Park : Multi Agent Self-Driving Simulation in Double Merge Scenario
Bingzhe Wu : LR2HR:Single Image Super Resolution via Learnable Perceptual Loss
Jaehyeon Kim: Applying Label Propagation in NMT
Elliott Wu : Sketch-Guided Text-to-Image Generation
KyuShik Min : Deep Reinforcement Learning based Self Driving Car Control
Xihan Li: One-shot Learning with Neural Turing Machines
Egil Martinsson : Weibull Time To Event RNN
Hyunwoo Song : RoadCNN: I know you well, Road.
Tommy Mulc : Speech Synthesis with GANs
Mark Kwon : Monaural Source Separation
Sally Hong : Clothing Recommender System
Gayoung Lee : Sketch Simplification using CycleGAN with Synthetic Dataset
Huseyin Coskun : Motion Generation
Jing Qiu : Attention based NMT
Rishabh Gupta : MAD-GANs
Kihoon Back : How much electricity will be produced by solar panel?
Lanfa Liu : Building Damage Mapping Using VHR Satellite Images
Sanghoon Yoon : ChangeGAN: Replaceable Representation Learning with Generative Adversarial Networks
Kiho Suh : StoryGAN: Storytelling with Artistic Images Synthesized from Storybook
Judit Acs : Morphological segmentation
Kyu-ye Song : RElational GAN (REGAN): Generating affective Text

...

저작자표시 (새창열림)

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

[AI] 네이버, 개인화 상품 추천 시스템 ‘에이아이템즈’ 공개 (0)	2017.09.04
[Chatbot] Python과 Tensorflow를 활용한 AI Chatbot 개발 및 실무 적용 (0)	2017.08.28
[ML] Machine Learning Jeju Camp http://mlcampjeju.com/ (0)	2017.08.17
[KAKAO] 플러스친구, 관리자센터 (0)	2017.08.11
[kakao] 카카오가 얘기하는 AI (0)	2017.08.11

[ML] Machine Learning Jeju Camp http://mlcampjeju.com/

홍반장水_ 2017. 8. 17. 09:26

2017. 8. 17. 09:26

Machine Learning Jeju Camp http://mlcampjeju.com/

GitHub - https://github.com/TensorFlowKR/MLJejuCamp

Call for application for Machine Learning Camp Jeju 2017

If you have studied machine learning/deep learning and TensorFlow, you probably want to implement a non-trivial and large-scale system for real use. We invite you to the month-long Machine Learning Camp Jeju 2017, where you can make that dream a reality.

For a full month in beautiful Jeju Island, you and other participants will train a deep learning model using TensorFlow from start-to-finish. Jeff Dean (Google Senior Fellow via Hangout), Rajat Monga (Google/TensorFlow Director (TBC)), and Prof. KyungHyun Cho (NYU) will give us keynote talks. Plus, you will have access to experienced mentors including Namju Kim (Head of Research for Kakao Brain), Sung Kim (HKUST), Lucy Park (TF-KR), Donghyun Kwak (TF-KR), Terry Taewoong Um (TF-KR), and many more. We hope you take advantage of this wonderful opportunity.

Those selected as participants will be provided with one round-trip airfare (up to 300 USD) to Jeju Island (South Korea), room and board at Jeju National University, USD 1,000 in stipends (can be used for the airfare, etc.) and USD 500 to 1,000 in Google Cloud Credit. In addition to these benefits, participants will gain valuable and practical experience in the field of deep learning. We look forward to your application!

Mentor Recruitment: If you’re interested in sharing your experiences and expertise with the camp, please contact us at mljejucamp@googlegroups.com. You will serve as personal mentors to 1 to 2 participants, holding 2 to 3 on/offline meetings a week to help them successfully complete their projects. While it is possible for you to provide online-only mentoring, we suggest you visit Jeju Island to meet with your mentees in person. We will provide round-trip airfare (up to USD 300) to Jeju Island and up to five (5) days of room and board.

(Information regarding schedule, program and benefits are subject to change as we are in the process of finalizing the details. We will have more information later.)

Camp Overview

Date: July 3rd through 28th, 2017 (Check-in date: July 2nd)
Participants: 20
Location: Jeju National University / Kakao Space.1
Organizers: TensorFlow Korea User Group, Kakao, Google, Smart Grid CK Center in Jeju National Univ, Jeju Center for Creative Economy and Innovation, Jeju Local Government (Subject to change)
Home page: https://github.com/TensorFlowKR/MLJejuCamp
Application: https://www.surveymonkey.com/r/LY29GM5 (By April 20 11:59PM AOE)
Contacts: Please leave your comments/questions on issues (https://github.com/TensorFlowKR/MLJejuCamp/issues) in this page.

Benefits (TBD)

Full month of hands-on experience training deep learning models with TensorFlow and mentorship from top developers
Round-trip airfare to Jeju Island (up to $300 USD)
Accomodation in Jeju National University or Kakao Space, Jeju
Stipend: 1,000 USD (can be used for the airfare, etc.)
Google Cloud Credit ($500~1000 TBD)

Qualification

No nationality, gender, age, degree, education requirements
Must be able to stay in Jeju Island from July 3rd to 28th. (Weekday camp programs run from 10AM to 5PM)
Good understanding of TensorFlow and deep learning and ability to train models (should be able to understand all in https://github.com/hunkim/DeepLearningZeroToAll)
Being able to release the code written during the camp publicly on github
Basic communication skills in English (All programs will be in English)

Application Closed! (By April 20 11:59PM AOE)

Detailed proposal for Deep Learning Camp Jeju 2017 project (Please be as detailed as possible)
CV that showcases applicant’s experience with deep learning and TensorFlow
Previously implemented models (GitHub or other)
Other supporting materials to show your qualification
Application link (closed): https://www.surveymonkey.com/r/LY29GM5

Proposal examples:

“I will implement paper X from 2016 NIPS Conference using TensorFlow and apply idea Y to the implementation”
“My goal is to add idea X to existing TensorFlow model Y and apply it to dataset Z” (Please justify why you are interested in the particular paper, model, dataset, etc. Write your proposal as detailed as possible as it will be the primary criteria to select participants.)

Basic Tasks (but not limited to)

Each participant will implement own deep learning related ideas and recently published ideas (in ICML, ICLR, NIPS, etc.) in TensorFlow. Or adapt already implemented ideas to new dataset. Participants will propose in the application.
Participate in camp program. (10AM-5PM on weekday basis from July 1 to July 30)
Deep learning and TensorFlow expert mentors will advise each participant.
Release the code on the github as Open Source.

Camp rules

participants may be dismissed from the camp for the following or similar reasons:

Repeatedly engage in behavior that negatively impacts other participants' work
Spend unreasonable amount of time on non-camp related tasks.
More than 3 missed camp days without proper notice.

Important dates

Application due: April 20 (AOE time zone)
Notification: May 10
Mentor assignment and online discussion: June 1
Camp starts: July 3

About Jeju

Located just off the coast of the Korean peninsula, Jeju Island is the largest volcanic island in Korea. Also known as Asia's Hawaii, the island is rife with beautiful sceneries and getaway resorts. Jeju boasts several natural treasures including Mount Halla, the country's highest peak, Trail Olle that winds around the rocky coastline, and Sunrise Peak, a dormant volcano ideal for catching sunrises and sunsets. You will be able to fully experience the island's charm for as long as a month without a visa. For more information, please visit at: http://www.visitjeju.net/en/index.jto.

About Camp

Deep Learning Camp Jeju 2017 is a month-long program (July 3-28, 2017) where participants gain hands-on experience with TensorFlow through individual-based projects with the goal of implementing new deep learning related ideas, and/or already-published ideas. We are looking for approximately 20 participants. More than a dozen industry experts with strong backgrounds in deep learning and TensorFlow implementation will serve as project mentors to guide participants.

FAQs

Q: What are we doing during the one month program?

A: Basically, we design a deep learning model and fully implement using TensorFlow. It is also possible to (re) implement a published paper (by others) and adapt it for new datasets. Based on this, each participants will propose their own plans in their application.

Q: What type of Visa is required for foreigners?

A: No visa is required for most countries. Please check at http://www.immigration.go.kr/HP/IMM80/imm_04/imm_p01/vm3.jsp

Q. Can I apply for partial participation? (i.e. only weekends)

A: Unfortunately No.

Q: Is this only for students?

A: No. Anyone who can spend one full month in Jeju Island in Korea, and work from 10AM-5PM during the weekdays can apply.

Q: Is this a contest?

A: No, this is not a contest. Individuals will have different projects.

Q: Is this a training or teaching program?

A: This is not specifically a teaching event. The applicant should have good understandings on programming, machine learning/deep learning, and TensorFlow. However, we will provide mentors to assist you on your project.

Q: Will data for training be provided or it is up to participants?

A: We will provide some public data sets but participants can also utilize their own data.

Q: Should I bring my laptop?

A: We won’t provide PCs. You need to bring your laptop. However, we will provide cloud server credits.

Q: Training takes a lot of time and computing power. Does the camp provide any support?

A: We will provide cloud server credits.

Q: What are the criteria for selecting applicants?

A: There is no formal criteria, but we are looking for interesting and feasible projects.

Q: What if the attendee cannot complete the work that they submitted?

A: There is no penalty, but mentors will guide each participant to success.

Q: Can developers/researchers working on longer term projects like http://ai-on.org/projects/cardiac-mri-segmentation.html apply? In this case, one month may not be sufficient time to finish the work. Is it OK to apply?

A: It’s OK. As long as the project is interesting, we will consider it. You can also propose a small portion of a larger project that you wish to work on for a month.

Q: Is this only for deep learning? Can I propose a reinforcement learning project?

A: Yes, reinforcement learning is acceptable. Feel free to include other types of interesting machine learning projects.

Q: Can I participate in a keynote session or open seminars even though I am not a camp participant?

A: Yes, the keynote and open seminars are open to everyone. We will have a separate announcement regarding keynote sessions and open seminars.

Q: Will you also be recruiting staff members for the camp?

A: Sorry, but we have no current plans to recruit staff members.

Q: I do have more questions. Where should I contact?

A: Please use the issue (https://github.com/TensorFlowKR/MLJejuCamp/issues) of this page to ask questions.

저작자표시 (새창열림)

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

[Chatbot] Python과 Tensorflow를 활용한 AI Chatbot 개발 및 실무 적용 (0)	2017.08.28
[ML] TensorFlowKR/MLJejuCamp Final Presentation (2017. 07. 27 - 28) (0)	2017.08.17
[KAKAO] 플러스친구, 관리자센터 (0)	2017.08.11
[kakao] 카카오가 얘기하는 AI (0)	2017.08.11
[chatbot] 카카오 플러스친구 Github 정보 (0)	2017.08.10

How to Make an Amazing Tensorflow Chatbot Easily

홍반장水_ 2017. 7. 28. 13:50

2017. 7. 28. 13:50

How to Make an Amazing Tensorflow Chatbot Easily

우리는 chatbot이이 년 동안 얼마나 깊은 학습 그들이 더 나은 방법했다 진화하는 방법에 대해 알아볼 것입니다. 그런 다음 우리는 파이썬에서 Tensorflow 기계 학습 라이브러리를 사용하여 우리 자신의 chatbot을 구축 할 수 있습니다.

이 비디오에 대한 코드 및 코딩 문제는 여기에 있습니다 :
은 https : //github.com/llSourcell/tensorf ...

이번주 게오르기의 경력 코드 :
HTTPS : //github.com/petkofff/p_vs_np_c ...

믹의 러너 업 코드 : 이번 주
HTTPS : //github.com/mickvanhulst/travS ...

: 우리의 여유 공간에 다른 마법사에 참여
https://wizards.herokuapp.com

라이브 순서 시퀀스 chatbot 데모에 :
HTTP : //neuralconvo.huggingface. 공동 /

chatbot이에 좀 더 유용한 자원 :
HTTP : //www.wildml.com/2016/04/deep-le ...
HTTP : //venturebeat.com/2016/08/01/how ...
HTTP : / /web.stanford.edu/class/cs124/l ...

Tensorflow에 더 많은 자원 :
HTTP : //lauragelston.ghost.io/speakeas ...
HTTPS : //speakerdeck.com/inureyes/buil ...

#Tensorflow Chatbot Tensorflow Chatbot Demo by @Sirajology on Youtube

Overview

This is the full code for 'How to Make an Amazing Tensorflow Chatbot Easily' by @Sirajology on Youtube. In this demo code, we implement Tensorflows Sequence to Sequence model to train a chatbot on the Cornell Movie Dialogue dataset. After training for a few hours, the bot is able to hold a fun conversation.

Dependencies

numpy
scipy
six
tensorflow (https://www.tensorflow.org/versions/r0.12/get_started/os_setup.html)

Use pip to install any missing dependencies

Usage

To train the bot, edit the seq2seq.ini file so that mode is set to train like so

mode = train

then run the code like so

python execute.py

To test the bot during or after training, edit the seq2seq.ini file so that mode is set to test like so

mode = test

then run the code like so

python execute.py

...

저작자표시 (새창열림)

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

[Chatbot] ChatScript 드디어 mac pro에서 구동완료 (0)	2017.08.01
[ChatBot] (해결함) ChatScript.. Mac에서 설치 안되서 질문 남김. Mac OSX - Bad Binary Dictionary entry...rebuild the binary~~~ (0)	2017.07.28
How to Make a Chatbot - Intro to Deep Learning #12 (0)	2017.07.28
[R] R 교육 블로그 http://blog.naver.com/tjdudwo93 (0)	2017.07.27
[ChatBot] ChatScript 설명. manual (0)	2017.07.27

PREV 이전 1 2 NEXT 다음

텐서플로

'프로그래밍 > Python' 카테고리의 다른 글

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

Call for application for Machine Learning Camp Jeju 2017

Camp Overview

Benefits (TBD)

Qualification

Application Closed! (By April 20 11:59PM AOE)

Proposal examples:

Basic Tasks (but not limited to)

Camp rules

Important dates

About Jeju

About Camp

FAQs

Q: What are we doing during the one month program?

Q: What type of Visa is required for foreigners?

Q. Can I apply for partial participation? (i.e. only weekends)

Q: Is this only for students?

Q: Is this a contest?

Q: Is this a training or teaching program?

Q: Will data for training be provided or it is up to participants?

Q: Should I bring my laptop?

Q: Training takes a lot of time and computing power. Does the camp provide any support?

Q: What are the criteria for selecting applicants?

Q: What if the attendee cannot complete the work that they submitted?

Q: Can developers/researchers working on longer term projects like http://ai-on.org/projects/cardiac-mri-segmentation.html apply? In this case, one month may not be sufficient time to finish the work. Is it OK to apply?

Q: Is this only for deep learning? Can I propose a reinforcement learning project?

Q: Can I participate in a keynote session or open seminars even though I am not a camp participant?

Q: Will you also be recruiting staff members for the camp?

Q: I do have more questions. Where should I contact?

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

Overview

Dependencies

Usage

'프로그래밍 > AI_DeepLearning' 카테고리의 다른 글

티스토리툴바