Risks of Using AI and Countermeasures: Hallucination, Information Leakage, and Bias

AI Navigate Original / 3/17/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical Usage
共有:

Key Points

  • Hallucination: don't make the AI's answer final—require basis/sources, convert claims into verifiable checklists, lean on RAG, and build human review into the process.
  • Information leakage: "don't put it in, separate it, don't keep it"—concretely list banned inputs, anonymize when necessary, use enterprise settings, and add DLP.
  • Bias: draw a use-case line (humans decide hiring/credit/medical), state neutrality in prompts, and run periodic attribute-swapped test cases.
  • The operational template that works: a one-page guideline plus workflow checkpoints plus short monthly training.

Introduction: AI Is Convenient. But "Careless" Accidents Happen Easily

Generative AI (such as ChatGPT) is active across a wide range, from writing, research, and summarization to coding support. On the other hand, risks such as hallucination (plausible lies), information leakage, and bias directly lead to real trouble depending on how you use it.

In this article, we break down difficult topics as much as possible and bring countermeasures useful in the field down to a "you can do it from today" level.

Risk 1: Hallucination (AI's "Plausible Misinformation")

What kind of problem occurs?

Hallucination is the phenomenon of AI confidently saying wrong things. It is especially likely in the following situations.

  • Latest information (outside the model's training range, very recent news)
  • Specialized domains (law, medicine, finance, security)
  • When a source is required (papers, statistics, systems, terms)
  • Proper nouns (people, companies, product names, clause numbers, model numbers)

Common "accident examples"

  • Presenting nonexistent papers or URLs as "references"
  • Asserting laws or systems while still in old content
  • Misleading by arbitrarily and "plausibly" filling in internal rules

Countermeasure: Make AI a "draft craftsman," not a "responder"

Hallucination countermeasures are, roughly speaking, "do not make the AI's answer the final answer." Specifically, the following work.

1) Ask for the basis (source) as a set

Put "always include the basis" and "write uncertain points as uncertain" into the prompt.

Example: "Answer in the order conclusion then basis then how to verify. Always attach a source URL or primary information (the official document name). For unclear points, don't guess; write 'unknown.'"

2) Have it convert into a "form that is easy to verify"

Rather than having it assert in prose, having it turn things into a checklist of verification items increases safety.

  • Decompose claims into bullets
  • Attach a "needs verification" flag to each claim
  • Have it propose where to verify (official sites, terms, primary sources)

3) Lean toward RAG (internal data search) or citation-based operations

If you use it for work, a mechanism that goes and fetches the correct information source is important rather than dumping everything on the model. For example, RAG (Retrieval-Augmented Generation) searches internal documents and a knowledge base and builds answers using those results as the basis.

Concrete options used include Azure AI Search, Amazon Kendra, Elasticsearch, OpenSearch, or search infrastructure integrated with Notion/Confluence/Google Drive.

4) Build human review in as a "process step"

What is strong in operations is to make review criteria explicit.

  • Numbers, dates, and proper nouns are checked in principle

Sign up to read the full article

Create a free account to access the full content of our original articles.