fail-early-pattern
정보
이 스킬은 개발자에게 가드 클로즈와 어서션을 사용하여 입력값을 검증하고 즉시 오류를 보고하는 '빠른 실패(fail-fast)' 패턴 구현 방법을 가르칩니다. R 언어의 실용적 예시와 함께 강력한 함수 작성 및 API 강화를 위한 크로스 언어 가이드를 제공합니다. 코드 리팩토링, 오류 처리 검토, 또는 무음 실패를 방지하기 위해 프로덕션 환경에 코드를 준비할 때 활용하세요.
빠른 설치
Claude Code
추천npx skills add pjt222/agent-almanac -a claude-code/plugin add https://github.com/pjt222/agent-almanacgit clone https://github.com/pjt222/agent-almanac.git ~/.claude/skills/fail-early-patternClaude Code에서 이 명령을 복사하여 붙여넣어 스킬을 설치하세요
문서
Fail Early
If something is going to fail, it should fail as early as possible, as loudly as possible, with as much context as possible. This skill codifies the fail-early pattern: validating inputs at system boundaries, using guard clauses to reject bad state before it propagates, and writing error messages that answer what failed, where, why, and how to fix it.
When to Use
- Writing or reviewing functions that accept external input (user data, API responses, file contents)
- Adding input validation to package functions before CRAN submission
- Refactoring code that silently produces wrong results instead of erroring
- Reviewing pull requests for error-handling quality
- Hardening internal APIs against invalid arguments
Inputs
- Required: Function or module to apply the pattern to
- Required: Identification of trust boundaries (where external data enters)
- Optional: Existing error-handling code to refactor
- Optional: Target language (default: R; also applies to Python, TypeScript, Rust)
Procedure
Step 1: Identify Trust Boundaries
Map where external data enters the system. These are the points that need validation:
- Public API functions (exported functions in an R package)
- User-facing parameters
- File I/O (reading configs, data files, user uploads)
- Network responses (API calls, database queries)
- Environment variables and system configuration
Internal helper functions called only by your own validated code generally do not need redundant validation.
Got: A list of entry points where untrusted data crosses into your code.
If fail: If boundaries are unclear, trace backwards from errors in logs or bug reports to find where bad data first entered.
Step 2: Add Guard Clauses at Entry Points
Validate inputs at the top of each public function, before any work begins.
R (base):
calculate_summary <- function(data, method = c("mean", "median", "trim"), trim_pct = 0.1) {
# Guard: type check
if (!is.data.frame(data)) {
stop("'data' must be a data frame, not ", class(data)[[1]], call. = FALSE)
}
# Guard: non-empty
if (nrow(data) == 0L) {
stop("'data' must have at least one row", call. = FALSE)
}
# Guard: argument matching
method <- match.arg(method)
# Guard: range check
if (!is.numeric(trim_pct) || trim_pct < 0 || trim_pct > 0.5) {
stop("'trim_pct' must be a number between 0 and 0.5, got: ", trim_pct, call. = FALSE)
}
# --- All guards passed, begin real work ---
# ...
}
R (rlang/cli — preferred for packages):
calculate_summary <- function(data, method = c("mean", "median", "trim"), trim_pct = 0.1) {
rlang::check_required(data)
if (!is.data.frame(data)) {
cli::cli_abort("{.arg data} must be a data frame, not {.cls {class(data)}}.")
}
if (nrow(data) == 0L) {
cli::cli_abort("{.arg data} must have at least one row.")
}
method <- rlang::arg_match(method)
if (!is.numeric(trim_pct) || trim_pct < 0 || trim_pct > 0.5) {
cli::cli_abort("{.arg trim_pct} must be between 0 and 0.5, not {.val {trim_pct}}.")
}
# ...
}
General (TypeScript):
function calculateSummary(data: DataFrame, method: Method, trimPct: number): Summary {
if (data.rows.length === 0) {
throw new Error(`data must have at least one row`);
}
if (trimPct < 0 || trimPct > 0.5) {
throw new RangeError(`trimPct must be between 0 and 0.5, got: ${trimPct}`);
}
// ...
}
Got: Every public function opens with guard clauses that reject invalid input before any side effects or computation.
If fail: If validation logic is getting long (>15 lines of guards), extract a validate_* helper or use stopifnot() for simple type assertions.
Step 3: Write Meaningful Error Messages
Every error message should answer four questions:
- What failed — which parameter or operation
- Where — function name or context (automatic with
cli::cli_abort) - Why — what was expected vs. what was received
- How to fix — when the fix is non-obvious
Good messages:
# What + Why (expected vs. actual)
stop("'n' must be a positive integer, got: ", n, call. = FALSE)
# What + Why + How to fix
cli::cli_abort(c(
"{.arg config_path} does not exist: {.file {config_path}}",
"i" = "Create it with {.run create_config({.file {config_path}})}."
))
# What + context
cli::cli_abort(c(
"Column {.val {col_name}} not found in {.arg data}.",
"i" = "Available columns: {.val {names(data)}}"
))
Bad messages:
stop("Error") # What failed? No idea
stop("Invalid input") # Which input? What's wrong with it?
stop(paste("Error in step", i)) # No actionable information
Got: Error messages are self-documenting — a developer seeing the error for the first time can diagnose and fix it without reading source code.
If fail: Review the three most recent bug reports. If any required reading source code to understand, their error messages need improvement.
Step 4: Prefer stop() Over warning()
Use stop() (or cli::cli_abort()) when the function cannot produce a correct result. Use warning() only when the function can still produce a meaningful result but the caller should know about a concern.
Rule of thumb: If a user could silently get a wrong answer, that is a stop(), not a warning().
# CORRECT: stop when result would be wrong
read_config <- function(path) {
if (!file.exists(path)) {
stop("Config file not found: ", path, call. = FALSE)
}
yaml::read_yaml(path)
}
# CORRECT: warn when result is still usable
summarize_data <- function(data) {
if (any(is.na(data$value))) {
warning(sum(is.na(data$value)), " NA values dropped from 'value' column", call. = FALSE)
data <- data[!is.na(data$value), ]
}
# proceed with valid data
}
Got: stop() is used for conditions that would produce incorrect results; warning() is reserved for degraded-but-valid outcomes.
If fail: Audit existing warning() calls. If the function returns nonsense after the warning, change it to stop().
Step 5: Use Assertions for Internal Invariants
For conditions that "should never happen" in correct code, use assertions. These catch programmer errors during development:
# R: stopifnot for internal invariants
process_chunk <- function(chunk, total_size) {
stopifnot(
is.list(chunk),
length(chunk) > 0,
total_size > 0
)
# ...
}
# R: explicit assertion with context
merge_results <- function(left, right) {
if (ncol(left) != ncol(right)) {
stop("Internal error: column count mismatch (", ncol(left), " vs ", ncol(right),
"). This is a bug — please report it.", call. = FALSE)
}
# ...
}
Got: Internal invariants are asserted so bugs surface immediately at the violation site, not three function calls later with a cryptic error.
If fail: If stopifnot() messages are too cryptic, switch to explicit if/stop with context.
Step 6: Refactor Anti-Patterns
Identify and fix these common anti-patterns:
Anti-pattern 1: Empty tryCatch (swallowing errors)
# BEFORE: Error silently disappears
result <- tryCatch(
parse_data(input),
error = function(e) NULL
)
# AFTER: Log, re-throw, or return a typed error
result <- tryCatch(
parse_data(input),
error = function(e) {
cli::cli_abort("Failed to parse input: {e$message}", parent = e)
}
)
Anti-pattern 2: Default values masking bad input
# BEFORE: Caller never knows their input was ignored
process <- function(x = 10) {
if (!is.numeric(x)) x <- 10 # silently replaces bad input
x * 2
}
# AFTER: Tell the caller about the problem
process <- function(x = 10) {
if (!is.numeric(x)) {
stop("'x' must be numeric, got ", class(x)[[1]], call. = FALSE)
}
x * 2
}
Anti-pattern 3: suppressWarnings as a fix
# BEFORE: Hiding the symptom instead of fixing the cause
result <- suppressWarnings(as.numeric(user_input))
# AFTER: Validate explicitly, handle the expected case
if (!grepl("^-?\\d+\\.?\\d*$", user_input)) {
stop("Expected a number, got: '", user_input, "'", call. = FALSE)
}
result <- as.numeric(user_input)
Anti-pattern 4: Catch-all exception handlers
# BEFORE: Every error treated the same
tryCatch(
complex_operation(),
error = function(e) message("Something went wrong")
)
# AFTER: Handle specific conditions, let unexpected ones propagate
tryCatch(
complex_operation(),
custom_validation_error = function(e) {
cli::cli_warn("Validation issue: {e$message}")
fallback_value
}
# Unexpected errors propagate naturally
)
Got: Anti-patterns are replaced with explicit validation or specific error handling.
If fail: If removing a tryCatch causes cascading failures, the upstream code has a validation gap. Fix the source, not the symptom.
Step 7: Validate the Fail-Early Refactoring
Run the test suite to confirm error paths work correctly:
# Verify error messages are triggered
testthat::expect_error(calculate_summary("not_a_df"), "must be a data frame")
testthat::expect_error(calculate_summary(data.frame()), "at least one row")
testthat::expect_error(calculate_summary(mtcars, trim_pct = 2), "between 0 and 0.5")
# Verify valid inputs still work
testthat::expect_no_error(calculate_summary(mtcars, method = "mean"))
# Run full test suite
Rscript -e "devtools::test()"
Got: All tests pass. Error-path tests confirm that bad input triggers the expected error message.
If fail: If existing tests relied on silent failures (e.g., returning NULL on bad input), update them to expect the new error.
Validation
- Every public function validates its inputs before doing work
- Error messages answer: what failed, where, why, and how to fix
-
stop()is used for conditions that produce incorrect results -
warning()is used only for degraded-but-valid outcomes - No empty
tryCatchblocks that swallow errors silently - No
suppressWarnings()used as a substitute for proper validation - No default values that silently mask invalid input
- Internal invariants use
stopifnot()or explicit assertions - Error-path tests exist for each validation guard
- Test suite passes after refactoring
Pitfalls
- Validating too deep: Validate at trust boundaries (public API), not in every internal helper. Over-validation adds noise and hurts performance.
- Error messages without context:
"Invalid input"forces the caller to guess. Always include the parameter name, the expected type/range, and the actual value received. - Using warning() when you mean stop(): If the function returns garbage after the warning, the caller gets a wrong answer silently. Use
stop()and let the caller decide how to handle it. - Swallowing errors in tryCatch:
tryCatch(..., error = function(e) NULL)hides bugs. If you must catch, log or re-throw with added context. - Forgetting call. = FALSE: In R,
stop("msg")includes the call by default, which is noisy for end users. Usecall. = FALSEin user-facing functions.cli::cli_abort()does this automatically. - Validating in tests instead of code: Tests verify behavior but do not protect production callers. Validation belongs in the function itself.
- Wrong R binary on hybrid systems: On WSL or Docker,
Rscriptmay resolve to a cross-platform wrapper instead of native R. Check withwhich Rscript && Rscript --version. Prefer the native R binary (e.g.,/usr/local/bin/Rscripton Linux/WSL) for reliability. See Setting Up Your Environment for R path configuration.
Related Skills
write-testthat-tests- write tests that verify error pathsreview-pull-request- review code for missing validation and silent failuresreview-software-architecture- assess error-handling strategy at the system levelcreate-skill- create new skills following the agentskills.io standardsecurity-audit-codebase- security-focused review that overlaps with input validation
GitHub 저장소
연관 스킬
evaluating-llms-harness
테스팅이 Claude Skill은 MMLU, GSM8K를 포함한 60개 이상의 표준화된 학술 과제에서 LLM 성능을 벤치마크하기 위해 lm-evaluation-harness를 실행합니다. 개발자들이 모델 품질을 비교하고, 학습 진행 상황을 추적하거나 학술 결과를 보고할 수 있도록 설계되었습니다. 이 도구는 HuggingFace와 vLLM 모델을 포함한 다양한 백엔드를 지원합니다.
cloudflare-cron-triggers
테스팅이 스킬은 cron 표현식을 사용하여 Worker를 스케줄링하기 위한 Cloudflare Cron Triggers 구현에 관한 포괄적인 지식을 제공합니다. 주기적 작업, 유지보수 작업, 자동화된 워크플로우 설정 방법을 다루며, 잘못된 cron 표현식이나 시간대 문제 같은 일반적인 이슈들을 해결하는 방법을 포함합니다. 개발자들은 이를 통해 스케줄된 핸들러 구성, cron 트리거 테스트, Workflows 및 Green Compute와의 연동 작업을 수행할 수 있습니다.
webapp-testing
테스팅이 Claude Skill은 Python 스크립트를 통해 로컬 웹 애플리케이션을 테스트하기 위한 Playwright 기반 툴킷을 제공합니다. 프론트엔드 검증, UI 디버깅, 스크린샷 캡처, 로그 확인 기능을 지원하며 서버 라이프사이클을 관리합니다. 브라우저 자동화 작업에 사용하되 컨텍스트 오염을 방지하기 위해 소스 코드를 읽지 않고 스크립트를 직접 실행하세요.
finishing-a-development-branch
테스팅이 스킬은 테스트 통과를 확인한 후 체계적인 통합 옵션을 제시하여 개발자가 완성된 작업을 마무리하도록 돕습니다. 구현이 완료된 후 머지, PR 생성, 브랜치 정리와 같은 워크플로우를 안내합니다. 코드가 준비되고 테스트가 완료되었을 때 개발 프로세스를 체계적으로 마무리하기 위해 사용하세요.
