Skip to main content
AboutProblemsGuidelinesTerms & conditions

Problem Statement 2

"Kisan-Query-Hub" - Smart Agricultural Knowledge & Early Warning System

Focus: Context-Aware NLP | Knowledge Graph Construction | Predictive Analytics | Real-time Data Fusion

1. Background:

The Kisan Call Center (KCC) has 20+ million farmer queries in 22 Indian languages—a goldmine for agricultural AI. However, 50-60% of data is unusable, and even after basic cleaning (which your team has already solved), three critical problems remain unsolved.

2. Current Problem:

1. Zero Context Problem

  • Queries exist in isolation—no link to weather, crop cycles, soil data, or market prices
  • Example: "Leaf curl disease" queries spike, but nobody knows it's linked to unusual rainfall until weeks later
  • Impact: Can't predict or prevent agricultural crises

2. Knowledge Trapped in Text

  • Expert answers are unstructured paragraphs, not reusable knowledge
  • Same problem described 50 different ways across languages isn't recognized as the same issue
  • Impact: AI can't learn patterns; every query treated as unique

3. No Early Warning System

  • Data shows what happened yesterday, not what's coming next week
  • Patterns that could predict pest outbreaks or crop failures 2-4 weeks early are invisible
  • Impact: Government and farmers always react too late

3. Goal:

Build a system that transforms cleaned KCC data into predictive agricultural intelligence.

Go beyond cleaning → Create a platform that:

  1. Enriches queries with real-world context (weather, soil, prices, crop stage)
  2. Extracts structured knowledge from unstructured expert responses
  3. Predicts agricultural crises before they happen
  4. Visualizes insights through an intelligent dashboard

4. Expected Solution (MVP Requirements):

1. Context Enrichment Engine

Auto-tag each query with:

  • Weather context: Rainfall, temperature from IMD API
  • Crop calendar: Current growth stage (sowing/flowering/harvest)
  • Local conditions: Soil health, groundwater levels, market prices (Agmarknet API)
  • Active alerts: Government advisories, pest warnings

Output: Each query becomes (original_query + translated_query + context_metadata)

2. Knowledge Graph Builder

Extract and structure agricultural knowledge:

Entity Recognition: Identify crops, diseases, pests, treatments, locations

Relationship Mapping: Build connections like:

  • (Rice) → [AFFECTED_BY] → (Blast Disease) → [TREATED_WITH] → (Tricyclazole)

Multi-lingual Unification:

  • Merge concepts: "पत्ती मोड़क", "இலை சுருட்டு", "Leaf Curl" → One entity
  • Build cross-language agricultural terminology map

Knowledge Extraction from Expert Responses:

  • Extract: Treatment methods, dosages, timings, success rates
  • Structure into queryable format

Demo Goal: Answer questions like:

  • "What organic treatments work for fungal diseases in Tamil Nadu?"
  • "Show all queries about tomato problems during monsoon"
3. Predictive Analytics Dashboard

Build 3 core prediction models:

a) Anomaly Detection

  • Flag sudden spikes in query types (30%+ increase)
  • Identify geographic clusters of similar problems
  • Alert: "Unusual surge in pest queries in 5 Gujarat districts"

b) Crisis Forecasting

  • Predict outbreaks 2-4 weeks ahead using:
    • Query patterns + weather data + historical trends
  • Output: District-level risk scores (0-100)

c) Smart Recommendations

  • Given farmer's query + context → Suggest preventive actions
  • Use knowledge graph (not generic AI responses)

Validation: Test on historical data—can you predict a real 2023/2024 agricultural crisis using past patterns?

4. Intelligence Dashboard

Real-time visualizations:

  • Crisis Heatmap: Live district alerts with severity colors
  • Knowledge Explorer: Visual graph navigation (click through diseases → crops → treatments)
  • Predictive Timeline: 30-day forecast of emerging issues
  • Impact Metrics:
    • Queries enriched with context
    • Knowledge entities extracted
    • Accuracy of predictions vs actual events

5. "Level Up" Features:

Advanced Features (Choose 2-3)

  • Voice Integration: Process audio call recordings → transcribe → extract knowledge
  • Misinformation Detector: Flag responses contradicting scientific consensus
  • Explainable AI: Show reasoning behind predictions ("Based on 347 similar cases...")
  • Intervention Simulator: Model impact of deploying advisories now vs later
  • Multi-lingual Semantic Search: Query in any Indian language, get relevant knowledge

6. Tech Stack:

Core

  • Backend: Node.js/Express + Python/FastAPI (for ML)
  • Database: MongoDB (queries) + Neo4j (knowledge graph)
  • Queue: Redis (background processing)

AI/ML

  • Translation: IndicTrans2 or Google Translate API
  • NLP: spaCy (entity extraction), LangChain (LLM orchestration)
  • Models: Scikit-learn/Prophet/SARIMAX (forecasting), HDBSCAN (clustering)

Frontend

  • Framework: React.js
  • Visualization: D3.js (graph), Recharts (charts), Deck.gl (maps)

External APIs

  • data.gov.in (KCC data), OpenWeatherMap (weather), Agmarknet (prices), ICAR (advisories)