MCP HubMCP Hub
스킬 목록으로 돌아가기

standardize-geo-values

TomGranot
업데이트됨 2 days ago
2 조회
33
11
33
GitHub에서 보기
기타general

정보

이 스킬은 "US"와 "USA"처럼 일관되지 않은 국가 및 주/지역 형식을 표준화하여 연락처와 회사 정보에서 신뢰할 수 있는 지리적 세분화를 보장합니다. 초기 데이터 위생 처리 후 정확한 필터링과 보고를 위해 레코드를 정리할 때 사용됩니다. 개발자는 지역별 캠페인이나 분석을 위해 데이터를 준비할 때 이 스킬을 적용해야 합니다.

빠른 설치

Claude Code

추천
기본
npx skills add TomGranot/hubspot-admin-skills -a claude-code
플러그인 명령대체
/plugin add https://github.com/TomGranot/hubspot-admin-skills
Git 클론대체
git clone https://github.com/TomGranot/hubspot-admin-skills.git ~/.claude/skills/standardize-geo-values

Claude Code에서 이 명령을 복사하여 붙여넣어 스킬을 설치하세요

문서

Standardize Country and State/Region Values

Convert inconsistent geographic formats (e.g., "US", "USA", "U.S." vs "United States"; "NY" vs "New York") to a single standard format across all contact and company records.

Why This Matters

Inconsistent geo values break geographic segmentation. A list filtering for "United States" will miss contacts labeled "US" or "USA". For B2B companies running region-specific campaigns or reporting by geography, this means inaccurate audience sizes and missed contacts.

Prerequisites

  • Phase 1 hygiene processes completed (invalid/deleted contacts removed first)
  • Access to Contacts and Companies views with bulk edit permissions
  • Key constraint: If your CRM integrates with another system (e.g., Salesforce, marketing automation), agree on the standard format (full names vs. ISO codes) with that system's admin BEFORE standardizing. Mismatched formats between synced systems will cause ongoing data conflicts.
  • Decision on standard format. This skill recommends:
    • Countries: Full names (e.g., "United States", "United Kingdom")
    • States (contact level): Full names (e.g., "New York", "California")
    • This matches HubSpot's default form behavior

Interview: Gather Requirements

Before executing, collect the following information from the user:

Q1: What format do you prefer for country values -- full names (United States) or ISO codes (US)?

  • Examples: Full names ("United States", "United Kingdom"), ISO 2-letter codes ("US", "GB"), ISO 3-letter codes ("USA", "GBR")
  • Default: Full names (e.g., "United States", "United Kingdom") -- this matches HubSpot's default form behavior

Q2: Do you have a Salesforce or other CRM integration that requires a specific format?

  • Examples: "Yes, Salesforce uses ISO 2-letter codes", "Yes, our ERP uses full country names", "No integrations to worry about"
  • Default: No integration constraints -- use HubSpot's default full-name format

Plan

  1. Audit all non-standard country and state values (before state)
  2. Build a mapping table of variants to standard values
  3. Batch update via API script or manual bulk edit in HubSpot UI
  4. Prevent future inconsistencies by configuring property types and forms
  5. Verify all values are standardized (after state)

Before State

API Audit Script

import os
from hubspot import HubSpot
from dotenv import load_dotenv

load_dotenv()
api_client = HubSpot(access_token=os.getenv("HUBSPOT_API_TOKEN"))

# Check for common country variants
variants = ["US", "USA", "U.S.", "U.S.A.", "America"]
for variant in variants:
    result = api_client.crm.contacts.search_api.do_search(
        public_object_search_request={
            "filterGroups": [{
                "filters": [{
                    "propertyName": "country",
                    "operator": "EQ",
                    "value": variant
                }]
            }],
            "limit": 0
        }
    )
    if result.total > 0:
        print(f"Contacts with country = '{variant}': {result.total}")

# Repeat for companies
for variant in variants:
    result = api_client.crm.companies.search_api.do_search(
        public_object_search_request={
            "filterGroups": [{
                "filters": [{
                    "propertyName": "country",
                    "operator": "EQ",
                    "value": variant
                }]
            }],
            "limit": 0
        }
    )
    if result.total > 0:
        print(f"Companies with country = '{variant}': {result.total}")

Manual Audit

  1. Go to Contacts > filter by Country/Region > is any of > "US". Note count.
  2. Repeat for "USA", "U.S.", and any other suspected variants.
  3. Repeat at the company level.
  4. For states: filter by State/Region > is any of > "NY", "CA", "TX" to check for abbreviation variants.

Record all variant counts as your baseline.

Execute

Method 1: API Batch Update (Recommended for Large Volumes)

# Pattern: Build mapping table, search for each variant, batch update

COUNTRY_MAPPING = {
    "US": "United States",
    "USA": "United States",
    "U.S.": "United States",
    "U.S.A.": "United States",
    "America": "United States",
    "UK": "United Kingdom",
    "GB": "United Kingdom",
    "Great Britain": "United Kingdom",
    # Add other mappings as discovered in your audit
}

STATE_MAPPING = {
    "NY": "New York",
    "CA": "California",
    "TX": "Texas",
    "FL": "Florida",
    "IL": "Illinois",
    "PA": "Pennsylvania",
    "OH": "Ohio",
    "GA": "Georgia",
    "NC": "North Carolina",
    "NJ": "New Jersey",
    "VA": "Virginia",
    "WA": "Washington",
    "MA": "Massachusetts",
    "AZ": "Arizona",
    "CO": "Colorado",
    "MD": "Maryland",
    "MN": "Minnesota",
    "MO": "Missouri",
    "WI": "Wisconsin",
    "CT": "Connecticut",
    "OR": "Oregon",
    "SC": "South Carolina",
    "LA": "Louisiana",
    # Add all 50 US states + territories as needed
}

# For each mapping:
# 1. Search for contacts with the variant value
# 2. Collect all matching contact IDs (paginate if > 100)
# 3. Batch update using crm.contacts.batch_api.update
# 4. Repeat for companies using crm.companies

API notes:

  • Search API caps at 10,000 results per query. Unlikely to hit this for geo variants, but segment if needed.
  • Batch update accepts up to 100 records per call.
  • Rate limit: 100 requests per 10 seconds.

Method 2: Manual Bulk Edit in HubSpot UI

For each variant:

  1. Go to Contacts > Lists > Create list (static)
  2. Filter: Country/Region > is any of > [variant value]
  3. Save list
  4. Select all contacts in the list
  5. Click Edit > Country/Region > type the standard value > Update
  6. Repeat for each variant

For companies, do the same from the Companies view using filters and bulk edit.

Prevent Future Inconsistencies

After standardizing existing data:

  1. Go to Settings > Properties > Contact properties
  2. Search for Country/Region
  3. Verify it is a Dropdown select field (not free text). If it is free text, consider converting to dropdown with standard country names.
  4. Repeat for State/Region (though state may need values for multiple countries, making a dropdown less practical)
  5. Check all active Forms (Marketing > Forms):
    • Verify country and state fields are dropdown fields, not free text inputs
    • Verify dropdown values match your standardized format
  6. Check any import templates used by the team to ensure they reference standard values

After State

# Re-run the same variant checks from before state
variants = ["US", "USA", "U.S.", "U.S.A.", "America"]
for variant in variants:
    result = api_client.crm.contacts.search_api.do_search(
        public_object_search_request={
            "filterGroups": [{
                "filters": [{
                    "propertyName": "country",
                    "operator": "EQ",
                    "value": variant
                }]
            }],
            "limit": 0
        }
    )
    assert result.total == 0, f"Still have {result.total} contacts with '{variant}'"
    print(f"Contacts with country = '{variant}': {result.total} (should be 0)")

Verification checklist:

  1. All known variant values return 0 contacts and 0 companies
  2. The standard value count should equal the sum of (previous standard count + all variant counts)
  3. Spot-check 10 contacts that were in cleanup lists to verify values are now standardized
  4. For states: filter by common abbreviations (NY, CA, TX) and confirm 0 results
  5. Forms are configured to prevent future inconsistencies

Key Technical Learnings

  • Do not touch blank/empty values in this process. Filling in missing country data is a separate enrichment task. This process only standardizes existing values that are non-empty but non-standard.
  • Coordinate with integrated systems first. If HubSpot syncs with Salesforce or another CRM, mismatched formats cause sync conflicts. Agree on the standard format before changing anything.
  • Company-level state abbreviations may be acceptable. HubSpot's default behavior for company State/Region often uses abbreviations. Decide whether to standardize company states or leave them as-is.
  • Root cause matters as much as cleanup. The variants were likely created by imports, API integrations, or free-text form fields that bypassed dropdowns. Fixing the root cause (forms, import templates, integration mappings) is as important as fixing existing data.
  • Export before editing (optional safety measure). Before bulk editing, export affected contacts/companies with their current values as a backup CSV.
  • Bulk edit limits vary by plan. HubSpot may limit bulk edits to certain batch sizes (100-250 at a time). For large numbers, you may need to repeat the select-all-and-edit process multiple times, or use the API approach instead.

GitHub 저장소

TomGranot/hubspot-admin-skills
경로: skills/standardize-geo-values
0

연관 스킬

llamaguard

기타

LlamaGuard는 폭력 및 혐오 발언 등 6가지 안전 범주에서 LLM 입력과 출력을 조정하기 위한 Meta의 70-80억 파라미터 모델입니다. 94-95% 정확도를 제공하며 vLLM, Hugging Face 또는 Amazon SageMaker를 사용해 배포할 수 있습니다. 이 기술을 사용하여 AI 애플리케이션에 콘텐츠 필터링 및 안전 가드레일을 손쉽게 통합하세요.

스킬 보기

cost-optimization

기타

이 Claude Skill은 리소스 적정화, 태깅 전략, 지출 분석을 통해 개발자들이 클라우드 비용을 최적화할 수 있도록 지원합니다. AWS, Azure, GCP에서 클라우드 비용을 절감하고 비용 거버넌스를 구현하기 위한 프레임워크를 제공합니다. 인프라 비용을 분석하거나, 리소스를 적정화하거나, 예산 제약을 충족해야 할 때 사용하세요.

스킬 보기

quantizing-models-bitsandbytes

기타

이 스킬은 bitsandbytes를 사용하여 LLM을 8비트 또는 4비트 정밀도로 양자화하며, 최소한의 정확도 손실로 50-75%의 메모리 감소를 달성합니다. 제한된 GPU 메모리에서 더 큰 모델을 실행하거나 추론을 가속화하는 데 이상적이며, INT8, NF4, FP4와 같은 형식을 지원합니다. 이 스킬은 HuggingFace Transformers와 통합되어 QLoRA 학습 및 8비트 옵티마이저를 가능하게 합니다.

스킬 보기

dispatching-parallel-agents

기타

이 Claude Skill은 3개 이상의 독립적인 문제를 동시에 조사하고 해결하기 위해 다중 에이전트를 배치합니다. 공유 상태나 의존성 없이 해결 가능한 무관련 장애 시나리오에 맞게 설계되었습니다. 핵심 기능은 병렬 문제 해결로, 각 독립 문제 영역마다 하나의 에이전트를 할당하여 효율성을 극대화합니다.

스킬 보기