Troubleshooting
This page is a quick FAQ for common Floability deployment issues.
FAQ
Manager and workers do not connect
Use this checklist:
- Check whether your manager is visible in the catalog: https://catalog.cse.nd.edu/
- Check TaskVine status tools: https://ccl.cse.nd.edu/software/taskvine/status/
- Verify manager port range policy for your cluster.
In many environments, the root cause is socket/port policy. Some sites only allow specific manager port ranges. See site pages for known working ranges:
Quick isolation test:
Try the TaskVine quickstart directly (without Floability): https://cctools.readthedocs.io/en/stable/taskvine/#quick-start
Interpretation:
- If TaskVine quickstart also fails, the issue is likely network/scheduler/site policy.
- If TaskVine quickstart works but Floability fails, network is likely fine and the issue may be Floability configuration.
In that case, open a Floability issue with command, logs, cluster, and error details: https://github.com/floability/floability-cli/issues
SSH -L tunneling does not work
Common fixes:
- Use the domain name you SSH into, not the raw IP printed by Jupyter.
- Ensure local port is free (use another local port if needed).
- Confirm you are tunneling to the same login node where Jupyter is running.
- Check VPN/firewall/jump-host requirements at your site.
Examples:
# Standard tunnel
ssh -L 8888:localhost:8888 <username>@<cluster-login-domain>
# If local 8888 is busy
ssh -L 8899:localhost:8888 <username>@<cluster-login-domain>
Then open the local URL with the token from terminal output.
Workers are not starting
Most often this is scheduler configuration.
What to check:
- Verify your scheduler access and allocation/permissions.
- Set required queue/partition/runtime options via
--batch-options. - Confirm your batch type matches site scheduler (
condor,uge, orslurm).
Example:
floability run --backpack <backpack-root> --batch-type slurm \
--batch-options "-p wide -t 02:00:00"
Use your cluster documentation for exact queue/partition/account flags. See site-specific deployment pages in Deployment Overview.