[Python] python -m 和 python 直接运行的区别

class Student:
    
    # this is the getter
    @property
    def score(self):
        return self._score
    
    # this is the setter
    @score.setter
    def score(self, value):
        # value check here
        self._score = value

When calling

1 2	Student().score # get_score # instead of Student()._score directly

More details about the func property here

Reference

【随笔】不要害怕犯错

Posted on 2019-05-14 | In 随笔

如果我们总想要做到完美，一步到位，其实是在逃避成长啊。

txtaug: text augmentation command line tool in Python

Posted on 2019-04-23 | Edited on 2019-04-29 | In Personal Projects

txtaug

txtaug is a command line tool written in Python3 for text augmentation based on Baidu translation API.

爬取Twitter搜索结果

Posted on 2019-04-17 | In Personal Projects

需求

希望得到大量（几乎全部）符合一定搜索条件的推文

现状

推特自身的API限制诸多，免费API每次请求只能得到100条，只能得到最近7天的推文。付费API虽然能检索full archive，但时间窗口被限定为30天，即只能检索过去所有推文中任意一个30天窗口内的推文…

类爬虫方式利用Microsoft Bing Tranlator进行翻译

Posted on 2019-04-17 | In Personal Projects

需求

最近要处理香港地铁相关的推文。推文中除了英文外还包含中文，日文，泰文等语言，同时一个句子中出现多个语言的情况很常见（比如，I am at 香港站）。

希望可以将其非英文的部分全部转换为英文。

Public Perception Analysis on Social Media

Posted on 2019-04-02 | Edited on 2019-04-05

Basic Infos

Student: ZHU Xingye (Joseph)
Supervisor: Prof. Francis C.M. Lau

Dissertation Abstract

Public perception analysis helps improve services and detect issues. This project conducts sentiment analysis and topic labelling task on Hong Kong MTR related tweets under Siemens application scene and compares algorithms adopted in each task. For sentiment analysis, we applies traditional deep neural network such as RNN, CNN on public massive sentiment dataset. For topic labelling, we crawled, labeled, augmented our own dataset and adopts latest transfer learning techniques like BERT, ULMFiT. For both tasks, we use FastText which is a light yet powerful and fast text classification algorithm as baseline.

In our experiments, RNN and ULMFiT achieved the best performance in sentiment analysis and topic labelling task respectively. Our experiments suggest that feature extraction determines model performance while most suitable feature extraction level depends on dataset(size, quality, etc) and label categories. Under extraction like CNN or over extraction like BERT might both lead to worse performance. Also, introducing transfer learning to NLP related tasks in public perception analysis is promising especially when labeled samples are limited.

Joseph Chu

一日之功很有限，不过可以积少成多