Agent-based models (ABMs) are widely used to analyze coupled natural and human systems. Descriptive models require careful calibration with observed data. However, ABMs are often not calibrated in a formal sense. Here we examine the impact of data record size and aggregation on the calibration of an ABM for housing abandonment in the presence of flood risk. Using a perfect model experiment, we examine (i) model calibration and (ii) the ability to distinguish a model with inter-agent interactions from one without. We show how limited data sets may not adequately constrain a model with just four parameters and relatively minimal interactions. We also illustrate how limited data can be insufficient to identify the correct model structure. As a result, many ABM-based inferences and projections rely strongly on prior distributions. This emphasizes the need for utilizing independent lines of evidence to select sound and informative priors.