Monocular Building Height Estimation from PhiSat-2 Imagery: Dataset and Method
arXiv cs.CV / 4/1/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper evaluates PhiSat-2 optical imagery for monocular building height estimation and addresses prior gaps in systematic assessment for this data source.
- It introduces the PhiSat-2-Height dataset (PHDataset) with 9,475 co-registered image-label patch pairs from 26 cities worldwide.
- The proposed Two-Stream Ordinal Network (TSONet) jointly performs footprint segmentation and height estimation, using modules including a Cross-Stream Exchange Module (CSEM) and Feature-Enhanced Bin Refinement (FEBR).
- Experiments on PHDataset show TSONet delivers best overall performance, lowering MAE and RMSE by 13.2% and 9.7% and improving IoU and F1-score by 14.0% and 10.1% versus the strongest competing approaches.
- Ablation and additional analyses confirm that the method’s footprint-aware feature interaction and ordinal refinement are effective, and that PhiSat-2’s resolution and multispectral bands help mitigate ambiguous height cues in monocular settings.
Related Articles

Black Hat Asia
AI Business

Knowledge Governance For The Agentic Economy.
Dev.to

AI server farms heat up the neighborhood for miles around, paper finds
The Register

Paperclip: Công Cụ Miễn Phí Biến AI Thành Đội Phát Triển Phần Mềm
Dev.to
Does the Claude “leak” actually change anything in practice?
Reddit r/LocalLLaMA