External Data Source
The External Data Source tool enables your agents to connect to external databases, APIs, and data repositories, providing seamless access to structured and unstructured data from various sources.
This tool has Pro status, meaning it requires a professional subscription and provides advanced features for premium users.
Overview
The External Data Source tool transforms your agents into data-driven powerhouses capable of:Database Connectivity
Connect to SQL and NoSQL databases securely
Real-Time Data Access
Query live data sources for up-to-date information
Multi-Source Integration
Aggregate data from multiple external sources
Secure Connections
Enterprise-grade security for sensitive data access
Supported Data Sources
SQL Databases
- PostgreSQL
- MySQL
- SQL Server
Features:
- Full SQL query support
- JSON/JSONB data types
- Advanced indexing and performance
- ACID compliance
- Enterprise data warehouses
- Analytics and reporting
- Transactional applications
- Complex relational data
NoSQL Databases
- MongoDB
- Redis
- Elasticsearch
Features:
- Document-based storage
- Flexible schema
- Horizontal scaling
- Rich query language
- Content management
- Real-time analytics
- Mobile applications
- IoT data storage
Cloud Data Sources
Amazon RDS
Amazon RDS
Supported Engines: PostgreSQL, MySQL, MariaDB, Oracle, SQL Server
Features: Managed database service, automatic backups, scaling
Authentication: IAM roles, username/password, SSL certificates
Use Cases: Scalable web applications, enterprise workloads
Google Cloud SQL
Google Cloud SQL
Supported Engines: PostgreSQL, MySQL, SQL Server
Features: Fully managed, high availability, automatic updates
Authentication: Cloud IAM, SSL certificates, private connectivity
Use Cases: Google Cloud applications, analytics workloads
Azure SQL Database
Azure SQL Database
Features: Serverless computing, intelligent performance, security
Authentication: Azure Active Directory, SQL authentication
Use Cases: Enterprise applications, SaaS platforms
Snowflake
Snowflake
Features: Cloud data warehouse, elastic scaling, data sharing
Authentication: Username/password, SSO, key-pair authentication
Use Cases: Data analytics, business intelligence, data science
Configuration
The External Data Source tool uses a streamlined configuration process through the PLai Framework interface. Specific connection parameters are configured during setup.
Connection Setup Process
1
Navigate to Tools
Go to the Tools section in your project dashboard
2
Create External Data Source
Click Create Tool and select External Data Source
3
Select Data Source Type
Choose your database or data source type from supported options
4
Configure Connection
Provide connection details including host, credentials, and database name
5
Test Connection
Verify connectivity and authentication with the test button
6
Set Access Permissions
Configure read/write permissions and query limitations
7
Add to Agent
Assign this data source to your agents in agent settings
Security Configuration
SSL/TLS Encryption
All connections use encrypted transport for data security
Credential Management
Secure storage and rotation of database credentials
Access Controls
Fine-grained permissions and query restrictions
Audit Logging
Complete logging of all data access activities
Usage Examples
Customer Data Integration
Sales Analytics Integration
Inventory Management Integration
User Behavior Analytics
Query Capabilities
Advanced SQL Features
- Complex Joins
- Window Functions
- CTEs and Subqueries
NoSQL Query Examples
- MongoDB Aggregation
- Elasticsearch Search
Performance Optimization
Query Optimization
Index Usage
Index Usage
Best Practices:
- Use appropriate indexes for frequently queried columns
- Avoid full table scans with proper WHERE clauses
- Use composite indexes for multi-column queries
- Monitor query execution plans
Connection Pooling
Connection Pooling
Benefits:
- Reuse database connections
- Reduce connection overhead
- Better resource utilization
- Improved response times
- Pool size: 10-50 connections
- Connection timeout: 30 seconds
- Idle timeout: 10 minutes
- Max lifetime: 1 hour
Query Caching
Query Caching
Strategies:
- Cache frequently accessed data
- Use appropriate TTL values
- Implement cache invalidation
- Monitor cache hit rates
Monitoring & Analytics
Track data source performance and usage:- Query Performance: Response times and execution plans
- Connection Health: Connection pool status and errors
- Data Volume: Amount of data transferred and processed
- Error Rates: Failed queries and connection issues
- Usage Patterns: Most frequent queries and access patterns
Security & Compliance
Data Protection
Encryption in Transit
All data transfers use TLS 1.3 encryption
Encryption at Rest
Database credentials encrypted in secure storage
Access Auditing
Complete audit trail of all data access
Role-Based Access
Fine-grained permissions per user and agent
Compliance Features
- GDPR Compliance: Data minimization and user rights support
- SOC 2 Type II: Security and availability controls
- HIPAA Support: Healthcare data protection capabilities
- PCI DSS: Payment card industry security standards
- ISO 27001: Information security management
Best Practices
- Use Read-Only Accounts: For data retrieval operations
- Parameterize Queries: Prevent SQL injection attacks
- Monitor Access: Log and monitor all database access
- Regular Audits: Perform security and compliance audits
- Data Classification: Classify data by sensitivity level
Troubleshooting
Common Issues
Connection Failures
Connection Failures
Symptoms: Cannot connect to database
Solutions:
- Verify network connectivity and firewall rules
- Check database server status and availability
- Validate connection string and credentials
- Ensure SSL/TLS configuration is correct
- Test connection from PLai infrastructure
Query Timeouts
Query Timeouts
Symptoms: Queries taking too long or timing out
Solutions:
- Optimize query performance with proper indexes
- Reduce result set size with appropriate filters
- Increase query timeout limits if needed
- Break complex queries into smaller parts
- Monitor database performance metrics
Permission Errors
Permission Errors
Symptoms: Access denied or insufficient privileges
Solutions:
- Verify database user permissions
- Check table and schema access rights
- Ensure proper role assignments
- Review database security policies
- Contact database administrator if needed
Data Format Issues
Data Format Issues
Symptoms: Unexpected data types or formats
Solutions:
- Check data type mappings between systems
- Handle NULL values appropriately
- Validate date and time formats
- Test with sample data first
- Implement proper error handling