Ting-Yu's website

Projects

FB Big Data Analysis (NTU CS+X Competition, MVP)

利用機器學習分析1900萬筆FB使用者按讚資料，了解臺灣極具討論度之社會議題（死刑存廢、臺灣統獨、同婚與否、核能存廢）的同溫層現象以及其與政黨、媒體的關係。
Our Slides
Shiny Source Code
Visit Our R Shiny Website

FB Big Data Analysis

Facebook 同溫層有多厚？

Intro

同溫層現象 (homophily) 意指對於議題有相似看法、相似教育程度、相似成長背景的個體，匯聚的現象。心理學家認為，人們會傾向降低認知失調(cognative dissonance)，較願意與特質和想法與自己相近的人相處。當一個團體的同溫層效應過於強大時，會使團體往兩極移動 (Sunstein, 2001; Mutz, & Martin, 2001; Stroud, 2010)。

本研究旨在以四個議題（統獨、廢死、同志婚姻、核能發電）、八個面向（挺廢死、反廢死、挺同、反同、擁核、反核、統一、獨立）將臺灣的臉書用戶利用集群分析 (cluster analysis, CA) 予以分群，觀察臺灣是否存在同溫層現象，以及各議題支持是否相關（例如：是否支持廢死，較有機會支持同志婚姻）。進而分析分群後的群體，所支持的政黨、閱讀的媒體、宗教傾向等。

Method

1. Data Preparation

特別感謝經濟系林明仁教授授權我們使用其研究計畫部分資料（QSearch）。資料內容為 2016 年 3 月在台灣臉書公開粉絲專頁貼文按讚的資料，包含：被按讚的貼文、被按讚文章所屬的粉絲專頁、按某則貼文讚的 Facebook 使用者。

我們將以使用者對代表性粉絲專頁的貼文按讚多寡作為該使用者對該議題立場的支持程度。

有關八個面向粉絲專頁的選取，本組採用的方法為：先人工列出足以代表八個面向（挺廢死、反廢死、挺同、反同、擁核、反核、統一、獨立）的粉絲專頁，再藉由爬蟲粉絲專頁互相的按讚行為，來找出其他可以代表八個面向的粉絲專頁。

Figure1. Raw Data

2. Data Preprocessing

我們將使用者在八大面向粉絲專頁貼文中的按讚數以對該議題按讚數的 z-score 進行標準化後，將資料整理為以使用者為 index 的八個維度的座標資料，整理成下表形式：

Figure2. Data Format

3. Cluster Analysis

我們採用集群分析的方式，透過非摺疊群集法 (nonhierarchical clustering) 中的 k-means 演算法將使用者進行分群。集群分析中的相似性測度 (similarity measure)，採用 R 語言內建的 k-means 分群，以歐氏距離 (Euclidean distance) 測定。並採用資料驅使 (data-driven) 的方式，透過組內總平方和 (total within sum of square)來輔助判斷較佳的分群數，在此將群集訂為10群。

Figure3. Scree Plot

我們透過各群集按讚的粉絲專頁性質，進行群集的人工標記，得到了反同(Homophobia)、不關心議題、熱衷死刑議題、挺同(GayMarriage)、支持死刑(Penalty)、廢死(EndPenalty)、挺核(Nuclear)、反核(Antinuclear)、獨立(Independence)、統一(Reunification)，並做相關分析，同時比較各群體在政黨以及媒體閱讀上的差異。

Main Results

1. 議題很小眾，絕大多數人在議題上未明顯表態。(6000人 vs 330000人)

Figure4. 大多數人在議題上未明顯表態

2. 非二元對立同溫層，而是很多個議題泡泡同溫層。(Check it out on our Shiny website 😀 )

相關分析顯示，大多數的相關不達統計顯著，其於數值最高之相關係數僅達-0.14（於反同/獨立）（Figure5）。

Figure5. 相關分析

從其中一個3D的圖表中亦顯示，在各著軸之間的平面鮮少有座標點，意即各個議題立場之間近乎獨立沒有關聯（Figure6）。

Figure6. 3D plot on our shiny website

3. 有些議題超越藍綠兩黨

以各群體為x軸，支持度為y軸的圖表上，我們可以很明顯看到，支持臺獨群體對民進黨支持度高，支持統一群體對國民黨支持度高。不過，我們也看到反對核能、支持核能的群體在對國民黨的支持度是差不多的。死刑存廢議題，無論正反立場，對藍綠兩黨以及時代力量的支持度都不高。以核能、死刑兩項議題來說已超脫黨派，與政黨支持之關聯較小。

Figure7. 各群體對黨派支持度

4. 不同同溫層閱讀不同的媒體

以各群體為x軸，支持度為y軸的圖表上，我們看到傳統被認為是綠媒(e.g.: 三立、自由時報)的媒體吸引了臺獨群體的支持；不過異常的是，支持獨立的群題則在各大媒體的偏好程度皆不高，推測可能這個該群體接收媒體訊息主要並非來自facebook（Figure8）。而其他新媒體（e.g.: 新唐人、新頭殼、風傳媒），則吸引所有「進步價值」的群體（Figure9）。

Figure8. 各群體對媒體支持度1

Figure9. 各群體對媒體支持度2

References (Selected)

Mutz, D. C., & Martin, P. S. (2001). Facilitating communication across lines of political difference:The role of mass media. American Political Science Review, 95(1), 97–114.

Sunstein, C. R. (2001). Republic.com. Princeton, NJ: Princeton University Press.

Stroud, N. J. (2010). Polarization and partisan selective exposure. Journal of Communication, 60(3), 556–576. doi:10.1111/j.1460-2466.2010.01497.x.

Acknowledge

謝謝臺大語言所謝舒凱教授指導，謝謝臺大經濟系林明仁教授提供Qsearch資料，謝謝莊昀軒、丁麒文、簡韻真、尤筱筑、黃詩媛和曾楷惪，我們一起完成了這份研究。

CogAT (Master Thesis)

Publication: Yeh, Y.-Y., Lin, T.-Y., Huang, T.-R., & Kuo, C.-Y. (2022) Development of a tablet-based task battery of executive functions. Chinese Journal of Psychology, in press.

Developed 16 Cognitive Tasks on ipad, R package (NTUCogTask), and Data Exploration GUI.
R package: NTUCogTask (Analysis Tool for CogAT)
Data Exploration Website demo
App Demos

CogAT (Master Thesis)

R package: NTUCogTask (Analysis Tool for CogAT)

Data Exploration Website demo

Intro

在平板認知功能測驗工具的設計中，我們綜合參考了Deary、Diamond、Baddeley以及Miyake的研究 (Baddeley, A. D., & Hitch, 1974; I. J. Deary, 2000; Diamond, 2013; Miyake & Friedman, 2012)，主要將可分為三大面向：處理速度（Processing Speed）、工作記憶（Working Memory）以及執行功能（Executive Functions），以下為各個類別下的測驗內容。

一、處理速度（Processing Speed）

（1）單純反應與選擇反應作業（Simple Reaction Time & Choice Reaction Time, SRT & CRT）

二、工作記憶（Working Memory）

（1）延遲配對樣本（Delayed Matching to Sample）

（2）關聯記憶作業（Memory of Association）

（3）物體關聯記憶作業（Memory of Association - Object）

（4）延遲反應作業（Delayed Response Task）

（5）空間記憶（Spatial Memory Task）

（6）符號記憶更新作業（Running Memory of Symbols）

（7）物體記憶更新作業（Running Memory of Objects）

（8）空間記憶更新作業（Running Memory of Locations）

（9）旋轉廣度作業（Rotation Span Task）

三、執行功能中的抑制功能（Inhibition）

（1）停止信號作業（Stop Signal Task）

（2）叫色作業（Stroop Task）

（3）逆向眼動箭頭作業（Antisaccade Arrow）

四、執行功能中的轉換功能（Switching）

（1）路徑描繪測驗（Color Trail Test, CTT）

（2）心花作業（Hearts & Flowers）

（3）圖形判斷轉換作業（Figure Task）

Demo

註：由於測驗繁多，僅列出少數測驗作為Demo

搜尋功能示範

APP介面操作以及路徑描繪測驗（Color Trail Test, CTT）

叫色作業（Stroop Task）

物體記憶更新作業（Running Memory of Objects）

References (Selected)

Baddeley, A. D., & Hitch, G. (1974). Working memory. In Psychology of learning and motivation. Academic Press, 8, 47–89.

Deary, I. J. (2000). Simple information processing and intelligence. Handbook of intelligence.

Diamond, A. (2013). Executive Function. Annual Review of Psychology, (64), 135–168. https://doi.org/10.1016/B978-0-12-385157-4.01147-7

Miyake, A., & Friedman, N. P. (2012). The nature and organization of individual differences in executive functions: Four general conclusions. Current Directions in Psychological Science, 21(1), 8–14. https://doi.org/10.1177/0963721411429458

Acknowledge

謝謝臺大心理系黃從仁老師的指導，沒有老師以及實驗室的資源就沒有這個App，也沒有今天能夠寫出App的我。謝謝臺大心理系葉怡玉教授，即使不是官方上的指導教授，仍願意全力協助我在理論上的理解以及實驗收案。謝謝模型建構與資訊學實驗室的前輩們——蒲冠吉、王翊豪，一同開發了雛形，同時給予許多我技術的協助。謝謝一路上給予我支持的朋友以及家人們，我很幸運能夠碰到這麼多願意幫助我的人。

Big Data Parallel Analysis (MPI)

This project aims for implementing a simple, parallelized application leveraging High-Performance Computing(HPC) facility —— SPARTAN. The application will analysis 20GB data from Twitter to identify the top10 most frequenly occurring hashtags and the languages most commonly used for tweeting.
Source Code

Big Data Parallel Analysis (MPI)

Source Code

Intro

這個專案的目的是實作出一個簡單的平行分析應用程式，並且在SPARTAN（HPC架構伺服器）上，分析20G的Twitter資料，找到前十名的hashtag以及tweet中所使用的語言。

Method

1. Message Passing Interface (MPI)

MPI communication protocol is applied to implement parallelism. To be specific, A python mpi library, mpi4py, is used in our code. As a part of data preprocessing, pronunciations in hashtags were removed via regular expression. For the parallelism, we allocate our data and into each available cores evenly, and each core will conduct the same program during the analysis. After finishing the calculating, mpi will gather the results together into the one core.

在這個專案中，我們使用MPI通訊協定來實作我們的平行分析。我們選用了python裡的mpi4py套件。在資料前處理的過程中，我們利用正規表示式(regular expression)來移除掉hashtag中的標點符號。在進行平行分析的過程中，資料會平均的被分配到我們所有的cores中，每一個core會進行相同的運算，運算結束之後透過mpi將結果回傳到特定一個core將結果搜集起來。

2. Slurm

Slurm is used as the resource manager and job scheduler on SPARTAN. The slurm file include the "SBATCH" options and the other command then run the following command on server. sbatch [slurmfilename] “sbatch” command will submit a batch script to Slurm. “sbatch” also read through the options preceded with “#SBATCH” in the script and transfer it to the Slurm controller. After being assigned a job ID, our job is added into the job queue and waiting for required resources. When the resources are allocated to our job, the job starts running.

Slurm是SPARTAN伺服器上所使用的資源管理工具，在我們slurm的原始碼中，指定了一些SBATCH的參數以及其他指令，可以利用下面的原始碼使用slurm進行工作排程。 sbatch [slurmfilename] sbatch指令會將我們的作業送給Slurm，並且分配job ID給我們的程式，並將其排到job queue中，等到伺服器有我們指定的運算資源時，就會讓我們的程式在相對的運算資源上運行。

Result

Figure1. Performance of each config

We run on two partitions (cloud & physical) and three set of resources (1n1c, 1n8c, 2n8c) respectively. (Note: 2n8c means 2 nodes 4 cores on each nodes.) As a result, workload was shared across all cores and the execution time was reduced significantly. In general, the performance on cloud partition was worse than physical partition. The differences between two partition is due to the networking speed. According to SPARTAN documentation, cloud partition is best suited for general-purpose single-node jobs. As for physical partition, each node is connected by high-speed 25Gb networking with 1.15 µsec latency. This makes physical partition suited to run multi-node jobs.

我們分別在兩個不同的partition(cloud和physical)上跑我們的程式，並且配置了1n1c, 1n8c, 2n8c的雲算資源（註：2n8c為兩個nodes各有4個cores）。從圖表中可以看到，我們的平行分析因為平分了各個core的工作量，所以執行時間有大幅降低。整體而言，cloud的表現較physical差，這是因為兩個partition的網路速度不同。根據SPARTAN的文件，一般而言cloud較適合做只需要單一node的作業。而physical的網路速度快、延遲時間小，適合做多個nodes的作業。

On the other hand, our parallel analysis is embarrassingly parallel. Therefore, number of nodes did not benefit to our performance in this case (1n8c vs. 2n8c). On the cloud partition, the performance on 2n8c even worse than 1n8c due to the communication cost.

另一方面，我們的平行分析性質屬於embarrassingly parallel。所以我們可以看到多個node在我們的分析中並沒有佔優勢(比較1n8c和2n8c)，甚至在有些狀況下的表現反而更差，這是因為將結果從不同node搜集回來，反而消耗了一些網路傳輸的時間。

Azul AI Agent

Development of AI agent for competing in board game -- Azul. We developed four agents based on DFS, Greedy Search, MCTS, Minimax respectively. Our final agent was Minimax with alpha-beta pruning, customized pruning and fine-tuned utility function. Also, we used some tricks to reduce the computing time so we could expand more nodes within the time constraint. We ranked 8 out of 64 in the tournament.

Source Code

Agents Video Demo Playlist

Azul AI Agent

Source Code

Agents Video Demo Playlist

1. Intro

In this project, an artificial intelligent agent was developed to compete in a board game Azul in a two-player scenario. Azul is a zero sum, mostly deterministic (with exceptions of start round factory display restocking), turn-taking, multi-player game with perfect information. In Azul, players take tile(s) from the factories to place on the pattern lines in their turn. At the end of round, players move tiles from their complete pattern lines over to their walls and earn the scores (see all official rules). To build an agent for this game, we develop a game theoretic minimax agent utilising a heuristic function to estimate the reward of a given state. We also explore several algorithms including a depth first search, shallow Greedy search, Monte Carlo Tree Search. The following sections will introduce the details of each algorithm, how we build the tournament agent, evaluation of performance, and discussion of our agent in the Azul framework.

2. Attempted Agents

Figure1. Brief Overview of all Agents (We will introduce Greedy, MCTS, Minimax in the following)

2.1 Simple Greedy Search

As stated in the depth first search section, blind search was not a good approach to take when creating an agent to play a game of azul due to the enormous state space. In order to utilise domain knowledge, a simple greedy agent was implemented. Given a game state and a set of legal moves, a heuristic function H (s, a) a function that returned a real number was designed, where s is a game state, and a is a legal action. The agent would then play a move via the following policy:

2.1.1 Heuristic function
This heuristic function attempted to estimate how ‘good’ a move was to make. The heuristic function applied a move a to state s, and scored that move using the game states score function to simulate an end of round given that game state and return the player score, thus returning the score of the move a. This heuristic function also took into consideration the number of tiles a player would take from the tile factory, scoring moves that take more tiles from a factory higher than taking less moves from the factory.

2.1.2 Results
Over 1000 games this greedy player (using heuristic v.1.0) had a win rate of 82.20% versing the naive_player. As a result, this agent was used as the baseline agent for testing the other approaches. A downside to this approach is that it only takes into consideration its current move and does not look ahead to help guide its decision.

2.2 Monte Carlo Tree Search (UCT)

2.2.1 State space reasoning
When choosing approaches, it was important to consider the state and action space of Azul. If we just consider the factory displays -- in a two player game of azul there are 5 factory displays with upto four tiles on each display. 20 tiles from a tile bag initially consisting of 100 tiles (20 of each colour) are picked and distributed on each of the 5 factory displays. Using the formula for combinations with repetitions, there are 70 different combinations of picking 4 tiles using 5 colours. Thus choosing 5 factory displays from 70 different unique combinations leads to 16108764 different possible initial states. If we just consider one start configuration of an azul game with empty wall lines. Let us assume that each factory tile contains 4 tiles of different colours. Thus there are 20 different ways to choose a tile from the factory displays. Since there are 5 wall lines and the floor line to place the tile into, thus there are 6 different ways to play each tile - therefore there are 120 actions that the player can play. Suppose that we apply one action, and the resulting game state has 80 legal actions to play. Thus if we just consider these two moves there are a total of 120x80 different game states leading to 9600 game states with just two moves. Thus the state space of azul explodes very fast. Thus doing a full tree search across AZUL that spans multiple rounds would take too long to compute. Thus the search technique had to be smart in the way it would pick which moves to explore. It was decided that a simulation based Monte Carlo Tree Search would be implemented in order to try pick a feasible move to make. Furthermore the strict evaluation time limit was not a problem with MCTS as it is an anytime algorithm.

2.2.2 MCTS Implementation
The variant of MCTS implemented was the Upper Confidence Trees algorithm (Brown, C.B et al. 2012) using UCB-1 to guide exploration and exploitation of the game tree. At the end of a playout the reward of the player for that round, or a large constant multiple of the difference in scores between our player and the opponent was returned if it was the end of the game.

MCTS was implemented using a full game roll out over 100 games against the naive_player, a win rate of 38% was achieved. Due to the stochasticity introduced due to the randomisation of factory displays, each roll out would have different factory display configurations thus leading to a wide variability in the games that were being played. There was a very low chance that a roll out would produce two games consisting of the same factory display configurations. We thought that MCTS was not able to converge to a reasonable reward value. Thus in order to try and increase the win rate, roll outs were terminated at the end of the current round thus MCTS would only be able to predict moves based on the current round. As a result, a win rate of 73% with an average score of against the naive agent was observed over 100 games.

The performance of MCTS is heavily dependent on the number of simulations it is able to do. MCTS was able to expand between 200-600 nodes/sec. Considering the state space of AZUL, MCTS is not exploring enough of the tree. A significant bottleneck with this implementation of MCTS and the implementation of the game model is deepcopying of game states to produce child nodes - this is a slow process that significantly impacts the number of nodes that we can expand. The following table shows the evaluation of our MCTS agent against 3 agents using different time evaluations.

Evaluation Time	Random Agent	Naive Agent	Greedy Agent
200ms	100%	42%	12%
500ms	100%	63%	19%
1000ms	100%	73%	27%

Table1. Win rate of MCTS against different agents across various time constraint

MCTS is capable of producing relatively well informed actions that can beat up to the naive player. But where it breaks it when it verses a purely greedy player. As we increase the evaluation time, the win rate against the Greedy Player does increase, but it is far from acceptable. Unlike minimax MCTS does not assume that the opponent is behaving rationally, thus this cannot be attributed to this result. It must be that the number of states that MCTS is evaluating is not enough for it to make a confident decision as to what move it should make. Furthermore as simulations are only going to the end of the round, the MCTS agent cannot infer moves past the current round it is playing. To try to increase the performance of the agent, a lightweight rollout was then tried, utilising the naive_player policy for the opponent agent. As a result the win rate of the MCTS agent increased to 30% using this rollout.

2.3 Final Agent - (ɑ-β)-Minimax with heavy move pruning and move ordering

2.3.1 Minimax description

Azul is a mostly deterministic (except for round initiations) two player adversarial game with perfect information. Thus to leverage this, a minimax agent was developed. The minimax algorithm is a game theoretic algorithm that assumes that the opponent is also playing rationally. Here rationally is defined as an agent chooses a move which maximises their reward given that the opponent is trying to minimise the agents reward. The minimax algorithm searched over the same state space as described in 2.1.1. As a result, minimax inherently takes into consideration opponents' moves unlike the greedy agent we designed. Pure minimiax is an exhaustive tree search algorithm having exponential time complexity hence is not suitable for azul. Instead of building an entire game tree, the state space was limited to the end of the current round. Thus making each search fully deterministic as we would not need to consider randomised factory displays on new rounds.

2.3.2 Optimisations

An optimization technique called alpha-beta pruning (Knuth, D. E., et al., 1975) was implemented -- the main idea is that we propagate/update the lower bound and upper bound utilities into relevant nodes, thus pruning future nodes that would not affect the result. In the best case, the time complexity of alpha-beta is O(b^(d/2)) and the average case is O(b^(3d/4)) (Russell, S., & Norvig, P., 2002). This best case scenario refers to having perfect move ordering (using an oracle to infer minimax values for that state) when exploring states. Furthermore a depth cutoff variant was implemented, whereby if a node was at the depth cutoff, the ‘goodness’ of that state was estimated using a utility function. Even with these optimisations and heuristics, the agent was not able to perform move evaluations to a depth of 2 within the given time frame. Thus the following changes were made -- first we ordered the set of actions using the utility function to score each move, and then we greedily took the top n number of moves to search with at each depth. Hence trying to approximate move ordering as well as heavy pruning to significantly decrease the width of each level in the tree, similar to doing a beam search. Thus the final minimax agent was not a purely optimal agent as we were greedy in the choice of moves we wanted to expand, thereby discounting a proportion of the search tree where the real optimal minimax value could lie in. Thus our model has two hyperparameters -the depth d we want to explore, and the width w representing the top w moves we want to explore at any given state. The width and depth of the agent was determined experimentally, by choosing the values of w and d that give us the best win rate against the greedy agent. Our final agent was set to explore to a depth of 4 moves using a maximum move width of 5.

2.3.3 Utility function

The performance of the minimax function is determined by the utility function. Initially the utility function used by the minimax agent utilised the heuristic function used in the greedy player. Essentially the utility function attempted to score each state just as if that was the final state in the round. For example, given all empty wall lines and all empty pattern lines, if a player made a choice that would fill an entire wall line, the utility function would return a value of 1 as it would result in placing one tile on the wall. By doing it this way, the utility function prefers placing tiles that would fill up an entire pattern line, and also prefer fill lines that correspond to placing tiles next to existing tiles in the wall line. This utility function served as a baseline utility function for our minimax agent.

2.3.4 Utility function design

Several utility functions were tried and tested in this agent but all failed to outperform the baseline utility function. As this utility function involved deepcopying the game state, the utility function was inherently slow. Thus as the algorithm was exponential in time complexity, this would result in an exponential amount of deep copy calls being called thus introducing a big bottle neck to minimax. Thus an improvement to the baseline was attempted, this time instead of deepcopying the state it would try to infer the score of a move without deepcopying. Furthermore, attempts to capture end game scoring bonuses also were taken into consideration as extra features. On average this heuristic function would evaluate a game state 20 times faster than that of the baseline. Despite this it would result in a 40% win rate against the greedy player thus failing to outperform the baseline heuristic which had a win rate of 63%. This could be attributed to sub-optimal weight assignments in the function (placing too much weight on the future/bonus reward features rather than our immediate reward).

Version	Heuristic v1	Heuristic v2
Time	0.00139s	0.00006412

Table2. Test of Computing Time for Each Heuristic

2.3.4 Utility function discussion

3.3.2 Utility function discussion Designing the utility function for minimax was deemed to be the most challenging part. Not only did it have to find strong features for azul, the ones that we used did not seem to either be hard to comprehend in code, difficult/slow to compute or we could not optimise the weights of the function well enough for it to perform well against the baseline. Overall fine tuning the weights of the minimax utility function was done experimentally by letting the agent perform multiple games against the greedy agent. This was not an efficient way of adjusting utility weights. We could have used temporal difference learning such as td-leaf (Baxter, J., et al., 1999).

3. Result

Using the baseline utility function, the following table outlines the winrate of the minimax agent against the other agents either developed by us or already provided.

vs.	Random Agent	Naive Agent	Greedy Agent	MCTS (UCT)
Minimax	100%	81%	63%	70%

Table3. Win Rate of Minimax Agent Against Others

Here we can see that this agent outperforms all other agents that have been developed so far, winning the majority of games against each agent. Because of this, this agent was chosen as the final agent to be developed to play the game. As the table shows, our final minimax agent was able to beat both the greedy agent as well as the mcts agent. Both the random and naive agents were also outperformed. Section 4 outlines the evaluation of our agent against other AI agents and staff created agents.

4. Tournament

Figure2. tournament result (Rank 8 out of 64)

Reference

Baxter, J., Tridgell, A., & Weaver, L. (1999). Tdleaf (lambda): Combining temporal difference learning with game-tree search. arXiv preprint cs/9901001.
Browne, Cameron B., Edward Powley, Daniel Whitehouse, Simon M. Lucas, Peter I. Cowling, Philipp Rohlfshagen, Stephen Tavener, Diego Perez, Spyridon Samothrakis, and Simon Colton. "A survey of monte carlo tree search methods." IEEE Transactions on Computational Intelligence and AI in games 4, no. 1 (2012): 1-43.
Knuth, D. E., & Moore, R. W. (1975). An analysis of alpha-beta pruning. Artificial intelligence, 6(4), 293-326.
Russell, S., & Norvig, P. (2002). Artificial intelligence: a modern approach.

AI: Deceptive Path-Planning

Implementation of Deceptive Path-Planning.
Masters, P., & Sardina, S. (2017, August). Deceptive Path-Planning. In IJCAI (pp. 4368-4375).
Source Code
πd2 Strategy Demo
πd3 Strategy Demo

AI: Deceptive Path-Planning

Source Code

The following videos are the implementation of deceptive path-planning via πd2 and πd3 strategy respectively in Masters and Sardina's paper.

The main idea of deceptive path-planning is to generate a path that makes it difficult for an outside observer to guess which goal is the true goal. In this map, the food in the upper left corner is the true goal, the food in the upper right corner is the fake goal.

Could you tell the intention of the Pacman in the very beginning 😏? Which strategy is more deceptive?

下面的影片是實作Masters和Sardina的deceptive path-planning這篇paper中 πd2 和 πd3 策略的demo。

Deceptive path-planning的主要想法是，希望能產生一條路徑，在行走的過程中，盡可能地讓觀察者無法辨識出agent真正要去的目標是哪一個。在demo影片中，左上角的食物是真正的目標，右上角的食物是假的目標。

你有辦法在影片一開始的時候，就知道pacman真正的目標是什麼嗎😏？哪一個策略更能騙到你呢？

Reference:
Masters, P., & Sardina, S. (2017, August). Deceptive Path-Planning. In IJCAI (pp. 4368-4375). PDF

πd2 Strategy Demo in Tiny Maze

πd3 Strategy Demo in Tiny Maze

Social Information Survey

It is a customized survey which applies website technique to demonstrate a non-traditional psychology survey (Inclusion of other in the self scale, IOS).
Website Demo

AI: Deceptive Path-Planning

The following video is the demo of one of my website project. It is a customized survey which applies website technique to demonstrate a non-traditional psychology survey (Inclusion of other in the self scale, IOS).

這個影片是我的其中一個網頁作品，為了使用非傳統心理學問卷IOS(Inclusion of other in the self scale)而客製化的響應式網頁。

Reference: Aron, A., Aron E. N., & Smollan, D. (1992). Inclusion of other in the self scale and the structure of interpersonal closeness. Journal of Personality and Social Psychology, 63, 596-612.

Responsive Website Demo

Ting-Yu Lin

林庭羽

Data Enthusiast

About Me

Skills:

Experience

Data Scientist

Cathay Century Insurance (subsidiary of Cathay Financial Holdings)

Data Scientist

AUOptronics (友達光電)

Frontend Developer

National Taiwan University, Modeling and Informatics Lab

Research Assistant

National Taiwan University, Modeling and Informatics Lab

Teaching Assistant

National Taiwan University, Psychoinformatics and Neuroinformatics (Graduate Cource)

Education

University of Melbourne

Postgraduate Exchange Program

National Taiwan University

Master of Science in Psychology

National Taiwan University

Bachelor of Science in Psychology

Projects

FB Big Data Analysis (NTU CS+X Competition, MVP)

CogAT (Master Thesis)

Big Data Parallel Analysis (MPI)

Azul AI Agent

AI: Deceptive Path-Planning

Social Information Survey

Publications

Courses

Data & AI & CS

Modeling & Statisitcs

Awards & Scholarships

Awards

Joint Conference of NTU/PKU/CUHJ Psychology Department Poster

Excellent Performance

NTU Computer Science + X competition

First Place

Scholarships

Lin's Scholarship

For Outstanding Academic Student to face the Challenge of Taiwan Society

Ministry of Education Scholarship

For Outstanding Academic Performance University Students Study Abroad

Extracurricular