DP on grid

1 minute read

Published: May 22, 2025

This is the first Handwritten solution for certain problem 3393. Count Paths With the Given XOR Value. It is necessary to understand the state-transition equation.

Fine-Tuning LLaMA 2 with `torchrun`

1 minute read

Published: February 14, 2025

Fine-tuning large models like LLaMA 2 is a big task, but with torchrun, you can scale it across multiple GPUs with ease. In this post, I’ll walk you through how to do it step by step.

Tensor Train Decomposition and Training

11 minute read

Published: March 01, 2025

High-dimensional data (tensors) appear in many fields such as scientific computing, quantum physics, and machine learning. However, storing and operating on these tensors is challenging due to the exponential growth of parameters with the number of dimensions (the so-called “curse of dimensionality”). Tensor Train (TT) decomposition is one way to represent high-dimensional tensors in a compact format by expressing them as a sequence of smaller 3D tensors (often called TT-cores).

Fine-Tuning LLaMA 2 with `torchrun`

1 minute read

Published: February 14, 2025

Fine-tuning large models like LLaMA 2 is a big task, but with torchrun, you can scale it across multiple GPUs with ease. In this post, I’ll walk you through how to do it step by step.

Tensor Train Decomposition and Training

11 minute read

Published: March 01, 2025

High-dimensional data (tensors) appear in many fields such as scientific computing, quantum physics, and machine learning. However, storing and operating on these tensors is challenging due to the exponential growth of parameters with the number of dimensions (the so-called “curse of dimensionality”). Tensor Train (TT) decomposition is one way to represent high-dimensional tensors in a compact format by expressing them as a sequence of smaller 3D tensors (often called TT-cores).

Fine-Tuning LLaMA 2 with `torchrun`

1 minute read

Published: February 14, 2025

Fine-tuning large models like LLaMA 2 is a big task, but with torchrun, you can scale it across multiple GPUs with ease. In this post, I’ll walk you through how to do it step by step.

Tensor Train Decomposition and Training

11 minute read

Published: March 01, 2025

High-dimensional data (tensors) appear in many fields such as scientific computing, quantum physics, and machine learning. However, storing and operating on these tensors is challenging due to the exponential growth of parameters with the number of dimensions (the so-called “curse of dimensionality”). Tensor Train (TT) decomposition is one way to represent high-dimensional tensors in a compact format by expressing them as a sequence of smaller 3D tensors (often called TT-cores).

Build and Run A Web Browser in Rust

2 minute read

Published: July 20, 2025

Rust Web Browser Build

4-6 Ownership, Struct, Enum

less than 1 minute read

Published: June 29, 2025

The Ownership of Rust

Intro 1-3 Build and Common Concept

3 minute read

Published: June 29, 2025

Rust for Beginner

Dynamic Programming

1 minute read

Published: May 18, 2025

Dynamic Programming House Robber

Understanding Memory Hierarchy

1 minute read

Published: January 05, 2024

Understanding Memory Hierarchy

Process Management in Operating Systems

less than 1 minute read

Published: January 10, 2024

Process Management in Operating Systems

Install Miniconda on Server

less than 1 minute read

Published: February 21, 2025

Step 1: Download Miniconda Locally

Run the following command to download the latest Miniconda installer for Linux (adjust the link if using macOS):

Python Env for Scientific Experiment

3 minute read

Published: February 15, 2025

As we usually need to work on various project, the environment for the experiment some times different from the local environment. Docker is one of the option, to run on the virtual machine. However, Conda or mamba is more like the standard for current academia.

Docker Container Debug with VScode

2 minute read

Published: February 27, 2025

When using the docker container, one may want to use the feature in the vscode to remotely work on the code debug. For personal server, this would guarantee a relatively clean environment together with the debug feature.

Utilize the Docker container

4 minute read

Published: February 21, 2025

CUDA Docker Container Setup and Usage Guide This tutorial covers how to build, run, attach, and detach a CUDA-enabled Docker container supporting three NVIDIA A6000 GPUs.

DP on grid

1 minute read

Published: May 22, 2025

This is the first Handwritten solution for certain problem 3393. Count Paths With the Given XOR Value. It is necessary to understand the state-transition equation.

Docker Container Debug with VScode

2 minute read

Published: February 27, 2025

When using the docker container, one may want to use the feature in the vscode to remotely work on the code debug. For personal server, this would guarantee a relatively clean environment together with the debug feature.

Utilize the Docker container

4 minute read

Published: February 21, 2025

CUDA Docker Container Setup and Usage Guide This tutorial covers how to build, run, attach, and detach a CUDA-enabled Docker container supporting three NVIDIA A6000 GPUs.

Install Miniconda on Server

less than 1 minute read

Published: February 21, 2025

Step 1: Download Miniconda Locally

Run the following command to download the latest Miniconda installer for Linux (adjust the link if using macOS):

Python Env for Scientific Experiment

3 minute read

Published: February 15, 2025

As we usually need to work on various project, the environment for the experiment some times different from the local environment. Docker is one of the option, to run on the virtual machine. However, Conda or mamba is more like the standard for current academia.

Understanding File Systems

less than 1 minute read

Published: January 15, 2024

Understanding File Systems

Finite State Machine Dynamic Programming

1 minute read

Published: May 18, 2025

Finite State Machine Dynamic Programming

DP on grid

1 minute read

Published: May 22, 2025

This is the first Handwritten solution for certain problem 3393. Count Paths With the Given XOR Value. It is necessary to understand the state-transition equation.

Starting point to study LLVM and JIT

6 minute read

Published: May 22, 2025

Learning LLVM and JIT with Rust

Understanding File Systems

less than 1 minute read

Published: January 15, 2024

Understanding File Systems

Starting point to study LLVM and JIT

6 minute read

Published: May 22, 2025

Learning LLVM and JIT with Rust

Starting point to study LLVM and JIT

6 minute read

Published: May 22, 2025

Learning LLVM and JIT with Rust

Dynamic Programming

1 minute read

Published: May 18, 2025

Dynamic Programming House Robber

Understanding Memory Hierarchy

1 minute read

Published: January 05, 2024

Understanding Memory Hierarchy

Dynamic Programming

1 minute read

Published: May 18, 2025

Dynamic Programming House Robber

Understanding Memory Hierarchy

1 minute read

Published: January 05, 2024

Understanding Memory Hierarchy

Process Management in Operating Systems

less than 1 minute read

Published: January 10, 2024

Process Management in Operating Systems

Fine-Tuning LLaMA 2 with `torchrun`

1 minute read

Published: February 14, 2025

Fine-tuning large models like LLaMA 2 is a big task, but with torchrun, you can scale it across multiple GPUs with ease. In this post, I’ll walk you through how to do it step by step.

Tensor Train Decomposition and Training

11 minute read

Published: March 01, 2025

High-dimensional data (tensors) appear in many fields such as scientific computing, quantum physics, and machine learning. However, storing and operating on these tensors is challenging due to the exponential growth of parameters with the number of dimensions (the so-called “curse of dimensionality”). Tensor Train (TT) decomposition is one way to represent high-dimensional tensors in a compact format by expressing them as a sequence of smaller 3D tensors (often called TT-cores).

Docker Container Debug with VScode

2 minute read

Published: February 27, 2025

When using the docker container, one may want to use the feature in the vscode to remotely work on the code debug. For personal server, this would guarantee a relatively clean environment together with the debug feature.

Utilize the Docker container

4 minute read

Published: February 21, 2025

CUDA Docker Container Setup and Usage Guide This tutorial covers how to build, run, attach, and detach a CUDA-enabled Docker container supporting three NVIDIA A6000 GPUs.

Install Miniconda on Server

less than 1 minute read

Published: February 21, 2025

Step 1: Download Miniconda Locally

Run the following command to download the latest Miniconda installer for Linux (adjust the link if using macOS):

Python Env for Scientific Experiment

3 minute read

Published: February 15, 2025

As we usually need to work on various project, the environment for the experiment some times different from the local environment. Docker is one of the option, to run on the virtual machine. However, Conda or mamba is more like the standard for current academia.

Tree Shape Dynamic Programming

1 minute read

Published: May 18, 2025

Tree Shape Dynamic Programming

Build and Run A Web Browser in Rust

2 minute read

Published: July 20, 2025

Rust Web Browser Build

4-6 Ownership, Struct, Enum

less than 1 minute read

Published: June 29, 2025

The Ownership of Rust

Intro 1-3 Build and Common Concept

3 minute read

Published: June 29, 2025

Rust for Beginner

Process Management in Operating Systems

less than 1 minute read

Published: January 10, 2024

Process Management in Operating Systems

Finite State Machine Dynamic Programming

1 minute read

Published: May 18, 2025

Finite State Machine Dynamic Programming

Understanding File Systems

less than 1 minute read

Published: January 15, 2024

Understanding File Systems

Tree Shape Dynamic Programming

1 minute read

Published: May 18, 2025

Tree Shape Dynamic Programming

Finite State Machine Dynamic Programming

1 minute read

Published: May 18, 2025

Finite State Machine Dynamic Programming

Tree Shape Dynamic Programming

1 minute read

Published: May 18, 2025

Tree Shape Dynamic Programming

Docker Container Debug with VScode

2 minute read

Published: February 27, 2025

When using the docker container, one may want to use the feature in the vscode to remotely work on the code debug. For personal server, this would guarantee a relatively clean environment together with the debug feature.

DP on grid

1 minute read

Published: May 22, 2025

This is the first Handwritten solution for certain problem 3393. Count Paths With the Given XOR Value. It is necessary to understand the state-transition equation.

Shiwen An

Posts by Tags

2dmatrix

AI Infrastructure

Algorithm

Fine-tuning

Machine Learning

Parallel Computing

Tensor

beginner

Rust Web Browser Build

The Ownership of Rust

Rust for Beginner

cache

Dynamic Programming House Robber

Understanding Memory Hierarchy

concurrency

Process Management in Operating Systems

conda

Step 1: Download Miniconda Locally

docker

dp

env

Step 1: Download Miniconda Locally

files

Understanding File Systems

fsm

Finite State Machine Dynamic Programming

grid

intro

Learning LLVM and JIT with Rust

io

Understanding File Systems

jit

Learning LLVM and JIT with Rust

llvm

Learning LLVM and JIT with Rust

memory

Dynamic Programming House Robber

Understanding Memory Hierarchy

performance

Dynamic Programming House Robber

Understanding Memory Hierarchy

processes

Process Management in Operating Systems

pyTorch

python

Step 1: Download Miniconda Locally

recursion

Tree Shape Dynamic Programming

rust

Rust Web Browser Build

The Ownership of Rust

Rust for Beginner

scheduling

Process Management in Operating Systems

state

Finite State Machine Dynamic Programming

storage

Understanding File Systems

subtree

Tree Shape Dynamic Programming

transitions

Finite State Machine Dynamic Programming

tree

Tree Shape Dynamic Programming

vscode

xor