Sunday, September 6, 2015





This paper came to my attention because of the extensive press it received.  The concept was also picked up by places like the Atlantic.   I gnawed on it for some time.  Eventually, I contacted the authors.  First about the most egregious problem.  They corrected that in an amendment in ASQ.  I then asked them to address the numerous other problems.  At the end of the explanation, they provide a suggestion for correcting some errors and misrepresentations.  What follows my original review and their response.

According to the Penn State News, this article finds that:

Corporate CEOs who exhibit narcissistic personality traits are more likely to embrace discontinuous or disruptive technologies than their less narcissistic counterparts, according to research by Donald Hambrick of Penn State's Smeal College of Business and colleagues from the University of Erlangen-Nuremberg and IMD International.

Either I do not understand this paper, or the authors are extremely confused, or it is poorly presented, or it is … misreported, or some mix of these.  As written, the model will not run, the corrections presented do not correct, and obsolete practice is used to understand mediation.

Some facts:
  1. The sample is of 33 firms over 28 years (possible 924 observations), but they get 435 or 521 depending on the table.  That is likely because some firms merged etc.
  2.  Their model: “For estimating the degree of adoption, we followed Allison and Waterman’s (2002) approach of using a Poisson specification (with dummies for each firm, to control for firm fixed effects) and then corrected standard errors for overdispersion by multiplying the standard errors by ‘‘the square root of the ratio of the Pearson chi-squared goodness-of-fit statistic to its degrees of freedom’’ (Allison and Waterman, 2002: 257).”
Indications the actual statistical model will not run (the matrix will not invert) or will not estimate meaningful coefficients.
  1. I am pretty sure one measure, Audience Engagement, is estimated at the year level and has no between firm variance.  Yet in table 2, they claim that “Calendar year dummies and firm year dummies are included in all models.”  The year dummies should be perfectly correlated with Audience Engagement.  My guess is that STATA randomly kept AE and dropped one or more of the year dummies.
  2. They can’t have firm-year dummies for firm year data.  /**A correction has since been published by ASQ.**/
  3. I don’t think it is possible to include both calendar year and firm-year dummies in the same model.  Calendar year should be a linear function of firm year. 
  4. They include a constant (Firm age_1980, that is the age of the firm in 1980) in the analysis.  How can they include a constant in the analysis?  If as they say they include dummies for each firm it will simply remove one of the year dummies.  I guess STATA will run it and throw out the unusable variables, but I am not sure how to interpret it. 
    1.        “Because large and old firms may face bureaucratic momentum, but also possess more capabilities to adopt biotechnology (Ahuja, Lampert, and Tandon, 2008), we controlled for annual firm size (revenues t-1, deflated by the consumer price index) and firm age in 1980, except for tests ofH3 and H4, which were tested with ‘‘xtreg, f(ixed) e(ffects)’’
Misleading and unjustified use of statistics.
  1.  They do a “Heckman” correction by running a Z = BX + BI+ e, where Z is a continuous measure.  They then use the prediction of E(Z) in the analysis and Z.  Y = B*E(Z)+BZ+BX+e.   The interpret the coefficient on Z.  This is neither a Heckman nor an IV.  I have consulted multiple statisticians about this. They can find no justification for this technique.
    1.  “We controlled for endogeneity, or the possibility that narcissistic CEOs are attracted by or hired into specific situations and/or that narcissistic tendencies emerge under certain conditions. Following the two-step approach of Heckman (1979), we first regressed several antecedent and contemporaneous variables against our measure of CEO narcissism. The antecedents (measured in the year prior to the CEO’s start), included firm revenues, firm age, calendar year, and ROE, as these factors might affect narcissistic tendencies. The contemporaneous variables (measured in the CEO’s second tenure year), included CEO duality, CEO age, and CEO origin (a dummy indicating if the CEO was an outside hire, defined as having arrived at the firm within two years prior to becoming CEO). Two of these variables, firm age and calendar year, were significantly associated with CEO narcissism (p < .01) in correlations and multivariate regression.6 As the second step in the Heckman correction, we created a predicted narcissism score for each CEO, based on the significant variables from step one, and included it as a control in the main analyses.”
Common bad practice.
  1. They do a “mediation” analysis by dropping an endogenous variable into the regression. 


I contacted both the authors and Jerry Davis (editor of ASQ).  The authors have admitted to the problems and submitted and amendment to ASQ about the problem created by the perfectly correlated dummy variables.  What follows is their proposal for fixing other problems.


**
Dear Andrew:

We’re following up on your email of 28 May.

You are right – the continuous audience engagement variable and year dummies should not have
been included in the same models, as they are perfectly collinear.

Here again, in order to estimate the effect of the annual value for audience engagement, Stata excluded two year dummies instead of just one, both of which then served as reference points. Fortunately, as you pointed out, Stata’s selection of the specific pair of reference years does not affect the significance of the audience engagement moderator, given that the interaction term is not multicollinear with the year dummies, so our hypothesized results are not affected. (For our own education and satisfaction, we have now conducted various experiments – dropping the linear audience engagement variable and one calendar year dummy (instead of dropping two dummies), as well as varying which pairs of year dummies are dropped – and the hypothesized interaction effects remain fully robust.

Our suggestion would be to modify our correction concerning this issue as follows:
In our analyses of Hypotheses 1 and 2, which are reported in models 1-4 in Table 2 (p. 275), and in the post-hoc analyses, which are summarized in the models 2-4 and 6- 8 in the Appendix (p. 289), we erred by simultaneously including an invariant firm age variable and firm dummies, as this caused perfect multicollinearity between these two sets of variables. (Stata deals with this problem by dropping two dummy variables, shifting the reference points for the fixed effects coefficients.) Given that firm dummies themselves encompass effects from firm age, we have re-run all respective regressions with the redundant firm age variable excluded (while still retaining the firm dummies). Results of our hypotheses tests remained identical (which is to be expected, as changing the reference points only influences the coefficients and significance levels of the dummy variables). Relatedly, we erred in including the continuous audience engagement variable, which was measured annually, and year dummies in the same models, as they are perfectly collinear. Here again, in order to estimate the effect of the annual value for audience engagement, Stata excluded two year dummies instead of just one, both of which then served as reference points. However, the selection of the specific pair of reference years does not affect the significance of the audience engagement moderator, given that the interaction term is not multicollinear with the year dummies, so our hypothesized results are not affected. Moreover, all results, including the hypothesized interaction effects, remain the same if we drop the linear audience engagement variable and one calendar year dummy (instead of dropping two dummies).

From your point for view, does that seem sufficient?

**

What do you think?  Is it sufficient?

No comments:

Post a Comment