Overview
Probably is built on a local-first architecture that keeps your data on your devices while leveraging AI capabilities through secure, encrypted connections.
Privacy-First Design
Your data never leaves your control. Probably processes everything locally while only sending encrypted queries to AI providers when needed.
What is Local-First?
Section titled “What is Local-First?”Core Principles
Section titled “Core Principles”Local-first software prioritizes user agency and data ownership:
- Data Ownership: Your data belongs to you and stays on your devices
- Privacy by Design: No surveillance, tracking, or data mining
- Offline Capability: Core functionality works without internet connection
- Performance: Local processing is faster than cloud alternatives
- Reliability: No dependence on external services for basic operations
Probably’s Implementation
Section titled “Probably’s Implementation”Probably implements local-first principles through:
Local Data Processing
- All data analysis happens on your machine
- Database connections are direct from your computer
- No data passes through external servers
- Full control over what data is processed
Selective AI Integration
- AI providers only receive specific queries you authorize
- Sample data and chart information sent only when explicitly using AI features
- Responses are processed locally
- All AI interactions are encrypted and logged
Transparent Operations
- Clear visibility into what data is accessed
- Audit trail of all AI interactions
- User control over every external connection
- Open architecture with no hidden data flows
Benefits of Local-First
Section titled “Benefits of Local-First”Data Privacy and Security
Section titled “Data Privacy and Security”Complete Data Control
- Your sensitive business data never leaves your premises
- No risk of data breaches at cloud providers
- Compliance with strict data privacy regulations
- Protection against surveillance and data mining
Encrypted External Communication
- All AI provider communications use encryption
- API keys stored securely on your device
- No persistent data storage on external servers
- Zero-knowledge architecture for AI interactions
Performance Advantages
Section titled “Performance Advantages”Local Processing Speed
- Database queries execute at native speed
- No network latency for data operations
- Immediate response for most analytical tasks
- Efficient use of local computing resources
Reduced Bandwidth Usage
- Only essential AI queries use internet
- No large data transfers to cloud services
- Efficient caching of AI responses
- Minimized network dependencies
Reliability and Availability
Section titled “Reliability and Availability”Offline Capability
- Core analysis functions work without internet
- Local data sources always accessible
- Cached AI responses available offline
- No downtime due to external service issues
Service Independence
- Not dependent on any single cloud provider
- Multiple AI provider support reduces vendor lock-in
- Local backup and restore capabilities
- User control over service dependencies
How Local-First Works in Probably
Section titled “How Local-First Works in Probably”Data Flow Architecture
Section titled “Data Flow Architecture”Your Data Sources → Local Processing → AI Queries (when needed) → Local Results ↓ ↓ ↓ ↓[Databases] [Your Computer] [Encrypted APIs] [Your Interface][Files] [Analysis Engine] [AI Providers] [Visualizations][Spreadsheets] [PXL Runtime] [Secure Channels] [Export Options]Local Components
Section titled “Local Components”Data Processing Engine
- Runs entirely on your machine
- Handles all data transformations
- Executes PXL expressions locally
- Manages visualization generation
Security Layer
- Encrypts all external communications
- Manages API key storage
- Implements access controls
- Provides audit logging
Caching System
- Stores AI responses locally
- Reduces external API calls
- Improves performance
- Respects data privacy
External Interactions
Section titled “External Interactions”AI Provider Queries
- Only send specific analysis requests
- Never include raw sensitive data
- Use encrypted communication channels
- Receive only relevant AI responses
Database Connections
- Direct connections from your machine
- No data routed through external services
- Standard database security protocols
- Your existing access controls apply
Privacy Guarantees
Section titled “Privacy Guarantees”What Stays Local
Section titled “What Stays Local”All Your Data
- Source databases and files
- Analysis results and insights
- Visualizations and reports
- User configurations and preferences
Processing Operations
- Data cleaning and transformation
- Statistical calculations
- Aggregation and filtering
- Visualization generation
Sensitive Information
- Customer data and PII
- Financial information
- Proprietary business data
- Internal analytics and metrics
What Uses External Services
Section titled “What Uses External Services”AI Analysis Requests
- Natural language questions and analysis requests
- Variable metadata (names, types, sample values)
- Chart data and visualizations for analysis
- Complete column data when using semantic functions (extract, classify, score)
Always with Your Control
- Explicit user authorization required
- Clear disclosure of what data is shared
- Option to disable external AI entirely
- Full audit trail of all interactions
Compliance and Regulations
Section titled “Compliance and Regulations”Data Privacy Standards
Section titled “Data Privacy Standards”GDPR Compliance
- Data minimization by design
- User control over data processing
- Right to data portability
- Clear consent mechanisms
Industry Standards
- SOC 2 Type II compatible architecture
- HIPAA-friendly data handling
- PCI DSS compliant payment data protection
- ISO 27001 security framework alignment
Audit and Governance
Section titled “Audit and Governance”Transparency Features
- Complete audit logs of data access
- Clear documentation of data flows
- User-controlled privacy settings
- Regular security updates and patches
Data Governance
- User-defined data retention policies
- Configurable access controls
- Backup and recovery under user control
- No vendor lock-in for data export
Comparison with Cloud Solutions
Section titled “Comparison with Cloud Solutions”Traditional Cloud Analytics
Section titled “Traditional Cloud Analytics”❌ Cloud-First Limitations
- Data stored on external servers
- Privacy and security risks
- Network latency affects performance
- Vendor lock-in concerns
- Compliance complexity
- Ongoing subscription costs
✅ Local-First Advantages
- Data stays under your control
- Maximum privacy and security
- Native performance speed
- No vendor dependencies
- Simplified compliance
- One-time licensing model
Best of Both Worlds
Section titled “Best of Both Worlds”Probably combines local-first benefits with AI capabilities:
- Local processing for all data operations
- Selective AI integration for advanced analysis
- User control over external service usage
- Transparent operations with full audit trails
Implementation Considerations
Section titled “Implementation Considerations”System Requirements
Section titled “System Requirements”Hardware Specifications
- Sufficient RAM for dataset processing
- Local storage for data and cache
- Network connectivity for AI services
- Standard business computer specifications
Software Dependencies
- Supported operating systems
- Database drivers as needed
- Network security compliance
- Standard business software stack
Deployment Models
Section titled “Deployment Models”Individual Users
- Desktop application installation
- Personal data source connections
- Individual AI provider accounts
- Local configuration management
Team and Enterprise
- Shared data source configurations
- Centralized license management
- Team collaboration features
- Enterprise security integration
Hybrid Approaches
- Mix of local and shared data sources
- Configurable AI service usage
- Flexible deployment options
- Scalable architecture design
What’s Next?
Section titled “What’s Next?”Architecture Details
Learn about the technical architecture that makes local-first possible.
Performance Benefits
Explore how local-first architecture delivers superior performance.