Manual Fleet Scaling
Control your worker capacity with real-time scaling, specialized cohorts, budget protection, and enhanced fleet visibility with diagnostic transparency.
Manual Fleet Scaling
Vanio AI's manual fleet scaling lets you instantly adjust your worker capacity up or down based on real-time demand. Whether you're preparing for a busy checkout weekend or scaling back during quiet seasons, you can add or remove workers with built-in budget protection and smart selection strategies. You can also organize workers into specialized groups (cohorts) for different types of operations like pricing updates or search optimization.
What it does
Manual fleet scaling gives you direct control over how many workers handle your vacation rental operations. The system monitors queue depth, worker load, and processing pressure in real-time, then lets you make informed scaling decisions with safety guardrails like budget caps and intelligent worker selection. Workers can be organized into cohorts that specialize in specific tasks, allowing you to scale different types of processing capacity independently.
Getting started
Scaling up workers
- Go to Fleet Management → Scaling in your dashboard
- Click Scale Up in the top toolbar
- Choose your provider and region from the dropdowns
- Set the number of workers to add (1-50 workers per batch)
- Optional: Select a worker cohort to target specific operations:
- General: Handles all types of tasks (default)
- Pricing: Specialized for rate calculations and revenue optimization
- Search: Optimized for property search and availability queries
- Custom: Your own specialized cohort for specific business needs
- Optional: Set a monthly budget cap to prevent overspending
- Click Preview Changes to see cost estimates and capacity impact
- Review the summary and click Scale Up to confirm
[Screenshot: Scale up dialog showing provider selection, worker count slider, cohort dropdown, and budget cap field]
Scaling down workers
- In Fleet Management → Scaling, click Scale Down
- Optional: Filter by cohort if you only want to scale down specific worker types
- Choose your scaling strategy:
- Lowest Activity: Removes workers with the least current workload (recommended)
- Newest First: Removes recently added workers, keeping experienced ones
- Oldest First: Removes longest-running workers for fleet refresh
- Set the number of workers to remove
- Click Preview Changes to see which specific workers will be affected
- Review the list and click Scale Down to confirm
[Screenshot: Scale down dialog showing cohort filter, strategy options and worker preview list]
How it works
Real-time monitoring
Vanio continuously tracks your fleet performance across three key metrics:
- Queue depth: How many tasks are waiting to be processed
- Worker load: Current capacity utilization per worker
- System pressure: Overall processing demand and response times
These insights appear in your scaling dashboard to help you make informed decisions.
Priority processing for paying customers
Paying customers now benefit from a fast-track processing system that significantly reduces data refresh times. Your property calendars, pricing, and listing details update much more frequently throughout the day, especially during busy booking seasons when prices change rapidly. This priority system ensures your listings stay current and competitive without waiting behind thousands of other processing jobs.
The system automatically routes your property updates through dedicated high-priority channels while maintaining the standard processing flow for comprehensive coverage. This dual-queue approach means you get both speed and reliability for your critical property data.
Complete infrastructure visibility
The fleet coverage map now displays your complete infrastructure across all cloud providers, including DigitalOcean and workers from any region. Previously, some workers could be missing from the overview if they were deployed in unmapped locations. Now you see the full picture of your fleet distribution, making it easier to identify scaling opportunities and regional capacity gaps. This improved visibility helps you make better decisions about where to add or remove workers based on your actual geographic coverage.
Improved fleet accuracy and diagnostics
The Fleet dashboard now provides more reliable worker counts by including all active workers in the totals, even those from regions that don't appear on the map. This ensures your scaling decisions are based on complete and accurate capacity information, eliminating previous undercounting issues that could make your fleet appear smaller than it actually was.
When you view the Fleet dashboard, you'll now see exactly which data source is being used to count your workers (PostgreSQL database or Redis cache). This diagnostic information helps administrators quickly spot any configuration issues that might affect worker visibility or scaling operations. If you notice inconsistencies in your worker counts, this transparency makes troubleshooting much faster.
Worker cohorts and specialized processing
Cohorts let you organize workers into specialized groups that handle different types of operations. When you create workers in a specific cohort, they automatically connect to the appropriate task queues for their specialization. This means you can scale your pricing optimization capacity independently from your search processing power, giving you granular control over different aspects of your property management operations.
Workers without a cohort assignment join the "general" group and handle all types of tasks, maintaining backward compatibility with existing fleet configurations.
Intelligent worker selection
When scaling down, Vanio automatically identifies the best candidates for removal based on your chosen strategy. The Lowest Activity strategy analyzes the last 30 minutes of worker performance to find truly idle workers, minimizing disruption to active operations. When you filter by cohort, the selection only considers workers from that specific group.
Budget protection
Set monthly spending limits to prevent accidental overspend. The system calculates your current run-rate across all active workers and blocks scaling requests that would exceed your cap. Budget checks happen before any new workers are created, so rejected requests don't incur charges.
Provider-specific optimization
Vanio handles the technical differences between cloud providers automatically:
- Cost-optimized providers: Workers are fully removed when scaling down, stopping hourly charges immediately
- Prepaid providers: Workers are paused instead of destroyed to avoid double-billing, then reactivated when you scale back up
- Native platforms: Some providers handle scaling through their own systems and aren't available for manual adjustment
Key features
• Instant capacity changes: New workers join your fleet within 2-3 minutes • Priority processing: Paying customers get faster property data updates through dedicated high-speed channels • Complete provider coverage: Full visibility across all cloud providers including DigitalOcean • Accurate fleet totals: Reliable worker counts including workers from unmapped regions • Data source transparency: See exactly which system is providing your worker counts for faster troubleshooting • Specialized worker cohorts: Organize workers by operation type for targeted scaling • Budget guardrails: Set spending limits to prevent unexpected charges • Smart worker selection: Choose removal strategy based on activity, age, or rotation needs • Cost transparency: See exact monthly cost impact before confirming changes • Dry run mode: Preview scaling changes without actually modifying your fleet • Automatic safety checks: System prevents removal of critical workers or over-provisioning • Real-time insights: Live metrics help you scale at the right time • Cohort-specific monitoring: Track performance and capacity by worker specialization • Geographic coverage map: See your complete infrastructure distribution across all regions
Tips & best practices
When to scale up
- Peak seasons: Add workers 24-48 hours before expected booking surges
- Marketing campaigns: Scale proactively when running promotional campaigns
- Queue alerts: If you see consistent backlogs in your task queue
- Response time degradation: When guest communications or booking confirmations slow down
- Operation-specific bottlenecks: Scale specific cohorts when you notice delays in pricing updates or search performance
When to scale down
- Off-season periods: Reduce capacity during predictably quiet months
- Post-campaign: Remove temporary workers added for marketing pushes
- Over-provisioning: If workers consistently show low activity percentages
- Cost optimization: Regular fleet reviews to match capacity with actual demand
- Cohort rebalancing: Scale down over-provisioned specializations while maintaining others
Maximizing priority processing benefits
- Monitor data freshness: Paying customers should notice significantly faster calendar and pricing updates throughout the day
- Peak season advantage: The priority system becomes most valuable during busy booking periods when standard processing can experience delays
- Real-time pricing: Take advantage of faster pricing updates to adjust rates more dynamically based on market conditions
Using the infrastructure overview
- Check the fleet coverage map regularly to understand your geographic distribution
- Look for gaps in coverage that might affect guest experience in specific regions
- Use the complete provider overview to make informed decisions about where to scale next
- The "regions pending geo" diagnostic helps you track workers that need region assignment
- Pay attention to the data source indicator to ensure your worker counts are accurate and up-to-date
Using worker cohorts effectively
- Start with General workers for basic operations, then add specialized cohorts as your needs grow
- Use Pricing cohorts during rate update seasons or competitive analysis periods
- Scale Search cohorts before marketing campaigns that will drive increased property browsing
- Create Custom cohorts for unique business processes specific to your operation
- Monitor cohort performance separately to identify which operations need the most capacity
Choosing scaling strategies
- Use Lowest Activity for routine capacity adjustments - it's the safest option
- Use Newest First immediately after adding workers you want to partially reverse
- Use Oldest First monthly or quarterly to refresh your fleet and maintain performance
- Always preview changes in dry run mode before large scaling operations
- When scaling down by cohort, ensure you maintain minimum capacity for critical operations
Budget management
- Set your monthly cap 10-15% above your target spend to allow for necessary emergency scaling
- Review and adjust budget caps seasonally as your business grows
- Monitor the cost estimates carefully - worker pricing varies significantly between providers and regions
- Factor in cohort specialization when budgeting - some operations may need more consistent capacity than others
Monitoring fleet health
- Check your fleet status dashboard regularly to track capacity and performance trends
- Use the complete infrastructure overview to identify scaling opportunities across all providers
- Trust the improved accuracy in worker counts for better scaling decisions
- If you notice worker count discrepancies, check which data source is active and contact support if needed
Common questions
How quickly do scaling changes take effect? New workers typically join your fleet and start processing tasks within 2-3 minutes. Scaling down is immediate - workers stop handling new tasks right away and finish their current work before shutting down.
What happens to in-progress tasks when I scale down? Workers complete their current tasks before shutting down, so nothing is lost. New tasks automatically route to remaining workers. The system never interrupts active guest communications or booking processes.
How does the priority processing work for paying customers? Paying customers' property updates automatically route through dedicated high-priority processing channels that skip ahead of standard job queues. This means your calendar availability, pricing changes, and listing details refresh much faster throughout the day, keeping your properties competitive during peak booking times.
Can I scale different providers and cohorts independently? Yes, each provider, region, and cohort combination scales independently. You might run 5 general workers and 3 pricing workers in US East, while maintaining different configurations in Europe. Use the provider, region, and cohort filters to target specific parts of your fleet.
What happens if I don't specify a cohort when scaling up? Workers without a cohort assignment automatically join the "general" group and handle all types of tasks. This maintains compatibility with existing scaling workflows and ensures your workers can process any operation type.
Why does my infrastructure overview now show more workers? We've improved the fleet coverage map to include all cloud providers and workers from every region. Previously, some workers (like those from DigitalOcean or unmapped regions) weren't displayed in the overview. Now you see your complete infrastructure, which may show higher worker counts than before - this reflects the true size of your fleet rather than a partial view.
What are "regions pending geo" in the diagnostic banner? These are active workers that haven't been assigned to a specific geographic region yet. They're fully functional and processing your tasks, but don't appear as pins on the coverage map. The system includes them in your total worker counts to give you accurate capacity information.
What does the data source indicator mean in my Fleet dashboard? This shows whether your worker counts are coming from the main database (PostgreSQL) or the cache system (Redis). Both sources should show the same numbers, but if you notice discrepancies or missing workers, this diagnostic helps administrators quickly identify and fix configuration issues. You don't need to take any action - it's just transparency to help with troubleshooting if needed.