SKILL·9C9932

model-markov-chain

Name: model-markov-chain
Author: pjt222

pjt222

업데이트됨 1 month ago

9 조회

메타aidesign

정보

이 스킬은 이산 또는 연속 마르코프 체인을 구축하고 분석하여 전이 행렬 구성, 정상 분포 계산, 평균 최초 도달 시간 결정과 같은 작업을 가능하게 합니다. 기억이 없는 시스템 모델링, 정상 상태 확률 계산, 상태를 일시적 또는 재귀적으로 분류하는 데 사용하세요. 또한 은닉 마르코프 모델이나 강화 학습 MDP와 같은 고급 모델의 기초 역할도 합니다.

빠른 설치

Claude Code

문서

Model Markov Chain

Construct, classify, and analyze discrete-time or continuous-time Markov chains from raw transition data or domain specifications, producing stationary distributions, mean first passage times, and simulation-based validation. Covers both DTMC and CTMC workflows end-to-end.

When to Use

You need to model a system whose future state depends only on its current state (memoryless property)
You have observed transition counts or rates between a finite set of states
You want to compute long-run steady-state probabilities for a process
You need to determine expected hitting times or absorption probabilities
You are classifying states as transient, recurrent, or absorbing for structural analysis
You want to compare alternative Markov models for the same system
You are building a foundation for more advanced models (hidden Markov models, reinforcement learning MDPs)

Inputs

Required

Input	Type	Description
`state_space`	list/vector	Exhaustive enumeration of all states in the chain
`transition_data`	matrix, data frame, or edge list	Raw transition counts, a probability matrix, or a rate matrix (for CTMC)
`chain_type`	string	Either `"discrete"` (DTMC) or `"continuous"` (CTMC)

Optional

Input	Type	Default	Description
`initial_distribution`	vector	uniform	Starting state probabilities
`time_horizon`	integer/float	100	Number of steps (DTMC) or time units (CTMC) for simulation
`tolerance`	float	1e-10	Convergence tolerance for iterative computations
`absorbing_states`	list	auto-detect	States explicitly marked as absorbing
`labels`	list	state indices	Human-readable names for each state
`method`	string	`"eigen"`	Solver method: `"eigen"`, `"power"`, or `"linear_system"`

Procedure

Step 1: Define State Space and Transitions

1.1. Enumerate all distinct states. Confirm the list is exhaustive and mutually exclusive.

1.2. If working from raw observations, tabulate transition counts into an n x n count matrix C where C[i,j] is the number of observed transitions from state i to state j.

1.3. For continuous-time chains, collect holding times in each state alongside transition destinations.

1.4. Verify no state is missing from the enumeration by checking that every observed origin and destination appears in the state space.

1.5. Document the data source, observation period, and any filtering applied. This provenance record is essential for reproducing the analysis and explaining anomalies.

Got: A well-defined state space of size n and either a count matrix or a list of (origin, destination, rate/count) tuples covering all observed transitions. The state space should be small enough for matrix operations (typically n < 10000 for dense methods).

If fail: If states are missing, re-examine the source data and expand the enumeration. If the state space is too large for matrix methods, consider lumping rare states into an aggregate "other" state or switching to simulation-based analysis. If the count matrix is extremely sparse, verify the observation period is long enough to capture typical transitions.

Step 2: Construct Transition Matrix or Generator

2.1. Discrete-time (DTMC): Normalize each row of the count matrix to obtain the transition probability matrix P:

P[i,j] = C[i,j] / sum(C[i,])
Verify every row sums to 1 (within tolerance).

2.2. Continuous-time (CTMC): Construct the rate (generator) matrix Q:

Off-diagonal: Q[i,j] = rate of transition from i to j
Diagonal: Q[i,i] = -sum(Q[i,j] for j != i)
Verify every row sums to 0 (within tolerance).

2.3. Handle zero-count rows (states never observed as origins) by deciding on a smoothing strategy: Laplace smoothing, absorbing convention, or flagging for review.

2.4. Store the matrix in a format suitable for downstream computation (dense for small chains, sparse for large ones).

Got: A valid stochastic matrix P (rows sum to 1) or generator matrix Q (rows sum to 0) with no negative off-diagonal entries in P and no positive diagonal entries in Q.

If fail: If row sums deviate beyond tolerance, check for data corruption or floating-point issues. Re-normalize or re-examine source data.

Step 3: Classify States

3.1. Compute the communication classes by finding strongly connected components of the directed graph induced by the transition matrix (only edges with positive probability).

3.2. For each communication class, determine:

Recurrent if the class has no outgoing edges to other classes.
Transient if it does have outgoing edges.
Absorbing if the class consists of a single state with P[i,i] = 1.

3.3. Check periodicity for each recurrent class by computing the GCD of all cycle lengths reachable from any state in the class.

Period = 1 means aperiodic.

3.4. Determine if the chain is irreducible (single communication class) or reducible (multiple classes).

3.5. Summarize: list each class, its type (transient/recurrent), its period, and whether any absorbing states exist.

Got: A complete classification: every state assigned to a communication class with labels (transient, positive recurrent, null recurrent, absorbing) and periodicity.

If fail: If the graph analysis is inconsistent, verify the transition matrix has no negative entries and rows sum correctly. For very large chains, use iterative graph algorithms instead of full matrix powers.

Step 4: Compute Stationary Distribution

4.1. Irreducible aperiodic chain: Solve pi * P = pi subject to sum(pi) = 1.

Reformulate as pi * (P - I) = 0 with the normalization constraint.
Use eigenvalue decomposition: pi is the left eigenvector of P corresponding to eigenvalue 1, normalized to sum to 1.

4.2. Irreducible periodic chain: The stationary distribution still exists but the chain does not converge to it from arbitrary initial states. Compute it the same way as 4.1.

4.3. Reducible chain: Compute the stationary distribution for each recurrent class independently. The overall stationary distribution is a convex combination depending on absorption probabilities from transient states.

4.4. CTMC: Solve pi * Q = 0 with sum(pi) = 1.

4.5. Verify: multiply the computed pi by P (or Q) and confirm the result equals pi within tolerance.

4.6. For reducible chains, compute the absorption probabilities from each transient state to each recurrent class. These probabilities, combined with the per-class stationary distributions, give the long-run behavior conditional on starting state.

4.7. Record the spectral gap (difference between the largest and second-largest eigenvalue magnitudes). This quantity governs the rate of convergence to stationarity and is useful for determining how many simulation steps are needed in Step 6.

Got: A probability vector pi of length n with all entries non-negative, summing to 1, satisfying the balance equations within tolerance. The spectral gap should be positive for aperiodic irreducible chains.

If fail: If the eigensolver fails to converge, try iterative power method (pi_k+1 = pi_k * P until convergence). If multiple eigenvalues equal 1, the chain is reducible -- handle per Step 4.3. If the spectral gap is extremely small, the chain mixes slowly and will require very long simulations for validation.

Step 5: Calculate Mean First Passage Times

5.1. Define the mean first passage time m[i,j] as the expected number of steps to reach state j starting from state i.

5.2. For an irreducible chain, solve the system of linear equations:

m[i,j] = 1 + sum(P[i,k] * m[k,j] for k != j) for all i != j
m[j,j] = 1 / pi[j] (mean recurrence time)

5.3. For absorbing chains, compute absorption probabilities and expected times to absorption:

Partition P into transient (Q_t) and absorbing blocks.
Fundamental matrix: N = (I - Q_t)^{-1}
Expected steps to absorption: N * 1 (column vector of ones)
Absorption probabilities: N * R where R is the transient-to-absorbing block.

5.4. For CTMC, replace step counts with expected holding times using the generator matrix.

5.5. Present results as a matrix or table of pairwise first passage times for key state pairs.

Got: A matrix of mean first passage times where diagonal entries equal mean recurrence times (1/pi[j]) and off-diagonal entries are finite for communicating state pairs.

If fail: If the linear system is singular, the chain has transient states that cannot reach the target. Report unreachable pairs as infinite. Verify the chain structure from Step 3.

Step 6: Validate with Simulation

6.1. Simulate K independent sample paths of the chain for T steps each, starting from the initial distribution.

6.2. Estimate the stationary distribution empirically by counting state occupancy frequencies across all paths after discarding a burn-in period.

6.3. Compare simulated frequencies to the analytical stationary distribution. Compute the total variation distance or chi-squared statistic.

6.4. Estimate mean first passage times empirically by recording the first hitting time for each target state across replications.

6.5. Report agreement metrics:

Max absolute deviation between analytical and simulated stationary probabilities.
95% confidence intervals for simulated first passage times vs. analytical values.

6.6. If discrepancies exceed tolerance, re-examine the transition matrix construction and classification steps.

Got: Simulated stationary distribution within 0.01 total variation distance of the analytical solution (for sufficiently long runs). Simulated mean first passage times within 10% of analytical values.

If fail: Increase simulation length T or number of replications K. If discrepancies persist, the analytical solution may have numerical errors -- recompute with higher precision.

Validation

The transition matrix P has all non-negative entries and each row sums to 1 (or Q rows sum to 0 for CTMC)
The stationary distribution pi is a valid probability vector satisfying pi * P = pi
Mean recurrence times equal 1/pi[j] for each recurrent state j
Simulated state frequencies converge to the analytical stationary distribution
State classification is consistent: no recurrent state has edges leaving its communication class
All eigenvalues of P have magnitude at most 1, with exactly one eigenvalue equal to 1 per recurrent class
For absorbing chains: absorption probabilities from each transient state sum to 1 across all absorbing classes
The fundamental matrix N = (I - Q_t)^{-1} has all positive entries (expected visit counts are positive)
Detailed balance holds if and only if the chain is reversible: pi[i] * P[i,j] = pi[j] * P[j,i] for all i,j

Pitfalls

Non-exhaustive state space: Missing states produce a sub-stochastic matrix (rows sum to less than 1). Always verify row sums before analysis.
Confusing DTMC and CTMC: A rate matrix must have non-positive diagonal and rows summing to 0. Applying DTMC formulas to a rate matrix produces nonsense.
Ignoring periodicity: A periodic chain has a valid stationary distribution but does not converge to it in the usual sense. Mixing time analysis must account for period.
Numerical instability for large chains: Eigenvalue decomposition of large dense matrices is expensive and can lose precision. Use sparse solvers or iterative methods for chains with more than a few hundred states.
Zero-probability transitions: Structural zeros in the transition matrix can make the chain reducible. Verify irreducibility before computing a single stationary distribution.
Insufficient simulation length: Short simulations with poor mixing produce biased estimates. Always compute effective sample size and check trace plots.
Assuming reversibility without checking: Many analytical shortcuts (e.g., detailed balance) apply only to reversible chains. Verify pi[i] * P[i,j] = pi[j] * P[j,i] before using reversibility-dependent results.
Floating-point accumulation in power method: Iterating pi * P many times accumulates rounding errors. Periodically re-normalize pi to sum to 1 during power iteration.

Related Skills

Fit Hidden Markov Model -- extends Markov chains to latent-state models with observed emissions
Simulate Stochastic Process -- general simulation framework applicable to Markov chain sample paths and Monte Carlo validation

GitHub 저장소

pjt222/agent-almanac

경로: i18n/caveman-lite/skills/model-markov-chain

agentsagentskillsai-assisted-developmentclaude-codeskillsteams

FAQ

Frequently asked questions

What is the model-markov-chain skill?

model-markov-chain is a Claude Skill by pjt222. Skills package instructions and resources that Claude loads on demand, so Claude can perform model-markov-chain-related tasks without extra prompting.

How do I install model-markov-chain?

Use the install commands on this page: add model-markov-chain to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does model-markov-chain belong to?

model-markov-chain is in the Meta category, tagged ai and design.

Is model-markov-chain free to use?

Yes. model-markov-chain is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

연관 스킬

content-collections

메타

이 스킬은 콘텐츠 콜렉션(Content Collections)을 위한 프로덕션 검증된 설정을 제공합니다. 콘텐츠 콜렉션은 Markdown/MDX 파일을 Zod 검증이 포함된 타입 안전한 데이터 콜렉션으로 변환해주는 TypeScript 최우선 도구입니다. 블로그, 문서 사이트 또는 콘텐츠 중심의 Vite + React 애플리케이션을 구축할 때 타입 안전성과 자동 콘텐츠 검증을 보장하기 위해 사용하세요. Vite 플러그인 구성과 MDX 컴파일부터 배포 최적화 및 스키마 검증에 이르기까지 모든 것을 다룹니다.

스킬 보기

polymarket

메타

이 스킬은 개발자들이 Polymarket 예측 시장 플랫폼을 활용한 애플리케이션을 구축할 수 있도록 지원하며, 거래 및 시장 데이터를 위한 API 통합 기능을 포함합니다. 또한 WebSocket을 통한 실시간 데이터 스트리밍을 제공하여 실시간 거래와 시장 활동을 모니터링할 수 있습니다. 이를 통해 거래 전략을 구현하거나 실시간 시장 업데이트를 처리하는 도구를 생성하는 데 활용할 수 있습니다.

스킬 보기

creating-opencode-plugins

메타

이 스킬은 개발자들이 명령어, 파일, LSP 작업 등 25개 이상의 이벤트 유형에 연결되는 OpenCode 플러그인을 만들 수 있도록 돕습니다. JavaScript/TypeScript 모듈을 위한 플러그인 구조, 이벤트 API 명세, 구현 패턴을 제공합니다. OpenCode AI 어시스턴트의 라이프사이클을 사용자 정의 이벤트 기반 로직으로 가로채거나, 모니터링하거나, 확장해야 할 때 사용하세요.

스킬 보기

sglang

메타

SGLang은 RadixAttention 프리픽스 캐싱을 활용하여 JSON, 정규식, 에이전트 워크플로우를 위한 고속 구조화 생성에 특화된 고성능 LLM 서빙 프레임워크입니다. 특히 반복되는 프리픽스가 있는 작업에서 상당히 빠른 추론 속도를 제공하여 복잡한 구조화 출력 및 다중 턴 대화에 이상적입니다. 제약 디코딩이 필요하거나 광범위한 프리픽스 공유가 있는 애플리케이션을 구축할 때는 vLLM과 같은 대안보다 SGLang을 선택하십시오.

스킬 보기