Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Posts

Agent Murphy

10 minute read

Published: March 04, 2026

Recently my research lab built a custom agent running on a spare lower-end Mac mini box that my mentor happens to have. We call it Murphy, in defiant spirit to Murphy’s law and tribute to the beloved Interstellar.

Let’s talk about reward and value

7 minute read

Published: December 09, 2025

I will skip the notation and introduction on RL as a Markov Decision Process, for that you can refer to this blog. Here is a checklist of questions to ask yourself before proceeding:

what $J(\theta)$ and $\nabla_{\theta} J(\theta)$ (recall policy gradient theorem) look like?
what is $Q$, $A$, $V$?
do you have 10 minutes to spare?

What is the architecture of an ideal agent?

4 minute read

Published: November 22, 2025

Here is a beta version of an ideal agent that I have been thinking about, this helps me personally to categorize a large amout of new ML research paper to one of the boxes or arrows, and to identify gaps.

portfolio

publications

My paper

Published in GitHub Journal of Bugs, 2024

This paper is about XXX. I don’t have a paper yet.

Recommended citation: Your Name, You. (2024). "Paper Title Number 3." GitHub Journal of Bugs. 1(3). http://academicpages.github.io/files/paper3.pdf

talks

Conference Proceeding talk 3 on Relevant Topic in Your Field

Published: March 01, 2014

This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Zhi Wang

Sitemap

Pages

Page Not Found

Hi! I'm Zhi.

Posts by Category

Posts by Collection

CV

Page Archive

Portfolio

Publications

Research

Sitemap

Posts by Tags

Talks and presentations

Teaching

Terms and Privacy Policy

Blog

Jupyter notebook markdown generator

Posts

Agent Murphy

Let’s talk about reward and value

What is the architecture of an ideal agent?

portfolio

publications

My paper

talks

Conference Proceeding talk 3 on Relevant Topic in Your Field

teaching

Teaching experience 1

Teaching experience 2