시간여행자: python 에서 특수문자 유니코드 변환 #remove_non

2016년 9월 6일 화요일

python 에서 특수문자 유니코드 변환 #remove_non_ascii

유니코드를 변형하기 위해 다음과 같은 놈을 찾았다.

For you the get the most alike representation of your original string I recommend:

from unidecode import unidecode
def remove_non_ascii(text):
    #return unidecode(unicode(text, encoding = "utf-8")) #난 3.5니 text를 바로..
    return unidecode( text )

Then you can use it in a string:

1 2	remove_non_ascii("Ceñía") Cenia

오? 멋진데?
해서 혹시나 다른 언어를 시도해봤다.

1 2	remove_non_ascii("דותן") dvtn

오호?!?! 일본어는?!

1 2	remove_non_ascii("ありがとう") airigatou

캬~~~~~~~ 이거쓰자!

( … )

어 -_-? 잠깐만.
한국어는 -_-?

1 2	remove_non_ascii("한글도?") hangeuldo?

에라~ 그지같은.. -_-++ 이게 뭔 개그냐;

관두자 관둬~ unicode 하나씩 비교한다내가!

시간여행자

2016년 9월 6일 화요일

python 에서 특수문자 유니코드 변환 #remove_non_ascii

댓글 없음:

댓글 쓰기