I would recommend using a model of the Lorentzian curve that more explicitly highlights the meaning of the parameters:
newmodel = area/Pi width/((t - peak)^2 + width^2);
In the model above, the position of the peak is indicated by peak, the curve's width parameter by width, and the area under the curve is given directly by area, as one can confirm below:
assumptions = {width > 0, peak ∈ Reals, area > 0};
(* Area under the curve )
Integrate[newmodel, {t, -Infinity, Infinity}, Assumptions -> assumptions]
( Out: area *)
(* Value at max )
valueAtMax = Simplify[MaxValue[newmodel, t], Assumptions -> assumptions]
( Out: area/(π width) *)
(* full width at half maximum = 2width )
Solve[newmodel == valueAtMax/2, t]
(* Out: {{t -> peak - width}, {t -> peak + width}} *)
With this model, fitting is easier as well:
data = Import["https://pastebin.com/raw/DMhcUZAQ", "CSV"];
result = NonlinearModelFit[data, newmodel, {width, peak, area}, t];
result["BestFitParameters"]
(* Out: {width -> 198.68, peak -> 4959.41, area -> 215.959} *)
Show[
ListPlot[data],
Plot[result[t], {t, 4370, 5600}, PlotStyle -> Red, PlotRange -> Full]
]

The only caveat is the fact that the area and width parameters are somewhat correlated:
TableForm[
result["CorrelationMatrix"],
TableHeadings -> {{"width", "peak", "area"}, {"width", "peak", "area"}}
]

With this, one would be in a better position to optimize the starting values of the parameters thanks to a more immediate grasp of their physical meaning.