AskNews developed a news analysis platform that processes 500,000 articles daily across multiple languages, using LLMs to extract facts, analyze bias, and identify contradictions between sources. The system employs edge computing with open-source models like Llama for cost-effective processing, builds knowledge graphs for complex querying, and provides programmatic APIs for automated news analysis. The platform helps users understand global perspectives on news topics while maintaining journalistic standards and transparency.
AskNews represents an innovative approach to deploying LLMs in production for news analysis and bias detection. The case study demonstrates several important aspects of LLMOps implementation in a real-world media context.
## System Architecture and Technical Implementation
The platform employs a sophisticated multi-layer architecture that processes news content through several stages:
At the edge layer, the system uses open-source LLMs (primarily Llama 2 and Llama 3.1) to perform initial content analysis directly at the data source. This approach helps manage costs that would be prohibitive using cloud-based services like OpenAI's models. The edge processing involves:
* Extracting key facts, people, and locations from articles
* Identifying attribution and allegations
* Assessing reporting voice and style
* Generating article metadata including geolocation and source origin
The system processes approximately 500,000 articles per day, creating embeddings for each piece of content. These embeddings enable efficient clustering of related news items and semantic similarity comparisons. The platform builds a comprehensive knowledge graph that captures relationships between entities (people, organizations, locations) mentioned in the news.
## Data Processing and Analysis Pipeline
The system implements several sophisticated processing steps:
* Automatic language detection and translation capabilities
* Clustering of related articles across languages and sources
* Extraction of contradictions and alignments between sources
* Building of topic-specific knowledge graphs
* Implementation of programmatic APIs for automated analysis
## Bias Detection and Diversity Enforcement
A key innovation in the platform is its approach to bias detection and source diversity. Unlike systems that pre-assign political leanings to sources, AskNews analyzes each article independently, allowing for more nuanced understanding of perspectives. The system:
* Enforces geographical and linguistic diversity in source selection
* Identifies contradictions between sources without making judgments
* Maintains transparency about source origins and attribution
* Provides clear citation trails back to original sources
## Production Deployment Considerations
The team has made several important operational decisions:
* Using open-source models for transparency and cost management
* Implementing edge computing to reduce latency and costs
* Building a tiered service model with both free and licensed content
* Providing programmatic APIs for integration with other systems
* Maintaining a transparency dashboard for system operations
## Scalability and Performance
The system demonstrates robust scalability through:
* Processing 500,000 articles daily across multiple languages
* Supporting real-time analysis and alerts
* Managing large-scale knowledge graphs
* Handling multiple concurrent user queries
* Processing content in multiple languages (approximately 60% non-English)
## Integration and Use Cases
The platform has found diverse applications:
* Journalists using it for fact-checking and research
* Academic institutions (like UT Austin) using it for misinformation research
* Security firms using it for global risk analysis
* NGOs using it for situation monitoring and alerts
## Challenges and Solutions
The team has addressed several key challenges:
* Managing costs through strategic use of open-source models
* Ensuring accuracy in source attribution and fact extraction
* Maintaining system performance at scale
* Handling copyright and licensing issues
* Building trust through transparency
## Future Development
The platform continues to evolve with planned features including:
* Natural language alerts for specific topics or regions
* Enhanced sharing capabilities for collaborative use
* Integration with messaging platforms (WhatsApp, Telegram)
* Improved customization options for different use cases
## Technical Debt and Maintenance
The team maintains the system through:
* Regular updates to underlying models (e.g., transitioning from Llama 2 to 3.1)
* Monitoring and updating the knowledge graph
* Maintaining API compatibility
* Ensuring data quality and source diversity
The case study demonstrates a sophisticated approach to deploying LLMs in production for news analysis, showing how careful architecture choices, strategic use of open-source models, and attention to scalability can create a powerful tool for news analysis while managing costs and maintaining transparency.
Start your new ML Project today with ZenML Pro
Join 1,000s of members already deploying models with ZenML.