Inquiry on Test Method for Beacon Tx Power Variation vs Golden Device

Dear Apple team, we have confusion about the test method for the specification that beacon-to-beacon transmit power average variation shall stay within ±5dB compared with the golden device. Could you kindly clarify the definition of the golden device, whether we should measure conducted or radiated Tx power, the averaging rule over advertising channels, required test sample quantity and environmental test conditions, and confirm if the ±5dB tolerance applies to each individual beacon’s average transmit power?

Inquiry on Test Method for Beacon Tx Power Variation vs Golden Device
 
 
Q