A Scale for the Assessment of Sexual Standards Among Youth: Psychometric Properties

The (hetero)sexual double standard (SDS), prescribing sexual modesty for girls and sexual prowess for boys, negatively affects sexual and mental health. Nevertheless, endorsement and enactment of the SDS is still common. In this study, we respond to recent calls for modernization in the field of sexual double standard research. We describe the development of the “Scale for the Assessment of Sexual Standards among Youth” (SASSY), as well as its psychometric properties. This instrument was designed to measure contemporary sexual double standard endorsement, defined as “the degree to which an individual’s attitude reflects a divergent set of expectations for boys and girls, in that boys are expected to be relatively more sexually active, assertive, and knowledgeable and girls are expected to be relatively more sexually reserved, passive, and inexperienced” among adolescents and emerging adults. In Study 1, a pool of 35 items was administered in a Dutch sample (N = 465, 54.8% female, age 16–20). A 20-item set formed a one-dimensional and internally consistent scale and was subsequently administered in a second Dutch sample. Study 2 (N = 818, 58.4% female, age 16–25) again assessed the 20-item set. After dropping one item, the 19-item SASSY proved to be one-dimensional and internally consistent, exhibiting good test–retest reliability, construct validity, and convergent validity. Finally, the instrument showed configural and metric measurement invariance across gender, age, education level, and sexual experience level, and configural, metric, and scalar measurement invariance across time. These studies confirmed the 19-item SASSY to be a reliable and valid new tool for the assessment of contemporary sexual double standard endorsement among adolescents and emerging adults.


Introduction
conducted the first large-scale study of attitudes toward''various degrees of sexual permissiveness embodied in our premarital standards'' (p. 6), in which he noted that egalitarianism had not yet been achieved and that a sexual double standard (SDS) existed for men and women (Crawford & Popp, 2003).Manystudiesonthesexualdoublestandardfollowedand the concept has been thoroughly reviewed over recent decades (Bordini & Sperb, 2013;Crawford & Popp, 2003;Eaton & Rose, 2011;Fugère, Escoto, Cousins, Riggs, & Haerich, 2008;Sanchez, Fetterolf, & Rudman, 2012). These reviews conclude that heterosexual romantic relationships seem to have become somewhat more egalitarian, but that the sexual double standard stillexists, albeit ina different form. Whereasoriginallythecentral notion of the sexual double standard pertained to premarital courting and sexual behavior (Reiss, 1967), later definitions focusedlessonmaritalstatus,andincludedexpectations interms of sexual roles in line with the sexual double standard (e.g., prescribing divergent [re]active and [sub]assertive sexual roles to men and women) (Sanchez et al., 2012).
The sexual double standard has been related to a multitude of negative sexual and health outcomes, such as increased dating violence and sexual violence (Shen, Chiu, & Gao, 2012), poor sexual functioning among young women (Kiefer & Sanchez, 2007),higher STI/HIVinfectionrisk (Bermúdez,Castro,Gude, & Buela-Casal, 2010), and decreased sexual and relationship satisfaction for both men and women (Sanchez, Crocker, & Boike,2005).However,otherstudiesinthisfieldhave produced mixed and contradictory results (Fugère et al., 2008;Marks & Fraley, 2005Sanchez et al., 2012). Reviews have partially ascribed this to methodological issues (Sanchez et al., 2012), as well as to the use of outdated measures (Bordini & Sperb, 2013;Crawford&Popp,2003;Fugère et al.,2008).Itseemsthat the concept of the sexual double standard has evolved along with changes in the display of gendered behavior in dating and sexuality, but researchmethods have not beenable to keeppace. This calls for the development of modernized methods and measures.
Inthisstudy,werespondtothiscallformodernization (Bordini & Sperb, 2013), by introducing a new measure to examine sexual double standard endorsement in its contemporary form. The new measure was designed based on a number of desired features. Firstly, we set out to develop an instrument that was suitable forcapturingsexual double standard endorsementfrom the moment people begin to experience romantic and partnered sexual situations. Adolescence is a period when people start to explore sexual and romantic interactions (Collins, Welsh, & Furman, 2009), whereas emerging adulthood is a time when romantic interactions become more serious and are more likely, compared to adolescence, to include sexual intercourse (Arnett, 2000).Wealsoknowthatthereisalreadyevidenceofsexualdouble standards in first-time sexual interactions (Sanchez et al., 2012). Simultaneously, older measures mostly asked about abstinence before marriage, which does not translate well to today's reality for young people as marriage rates fall and different relationship forms, such as co-habitation, are on the rise. Therefore, the suitability of the new instrument specifically for assessment among both adolescents and emerging adults was a key factor in its design.
We therefore chose to reflect this multifaceted nature of the contemporarysexualdoublestandardintheitempoolofthenew instrument(Study1).Indoingso,webroadlydefinethecontemporary sexual double standard as''the degree to which an individual's attitude reflects a divergent set of expectations for boys andgirls,inthat boysareexpectedtoberelatively moresexually active,assertive,andknowledgeableandgirlsareexpectedtobe relatively more sexually reserved, passive, and inexperienced'' (Emmerink, van den Eijnden, Ter Bogt, & Vanwesenbeeck, 2016). The instrument thus assesses an individual's attitude toward perceived social norms concerning sexuality for boys and girls. We wish to be mindful of deleting any of the themes established above entirely, through the deletion of items based onstatistical arguments,althoughthiscannot be completelyprevented. We believe that the leading argument in this case should be that the multifaceted nature of the scale should not be compromised.
Thirdly,sincethesexualdoublestandardisahighlyheteronormative phenomenon, the instrument was designed specifically for assessment in heterosexual samples. Although non-heterosexual populations are also bound to be affected by heteronormative gender norms (Szymanski & Henrichs-Beck, 2014), it is plausible that they are affected in a different way. It may be possible in the future to adapt the instrument for use in non-heterosexualsamples.Fourthly,toenhancecomparabilityoftheresults for young men and women, as well as to enable the instrument to be used in multiple study types, we designed the instrument so that it would be suitable for assessing both females and males. Fifthly and lastly, the original item pool was constructed in a manner matching our expectation that this instrument would measure a single construct, namely the sexual double standard. Thestudyshouldshow,however,whetherthisisindeedthecase.
This article covers both instrument development (Study 1) and tests of psychometric properties (Study 1 and 2). The following research questions are addressed in the two studies: 1. Which subset of items for assessing sexual double standard endorsementamongmaleandfemaleadolescentsandemerging adults forms the best one-factor and internally consistent scale? (Study 1). 2. Does the factor structure of the newly developed instrument established in the first study replicate in another sample, and what are the test-retest reliability, the construct validity, and convergent validity of this instrument? (Study 2). 3. Does the newly developed instrument show measurement (in)variance across time, gender, age, education level, and sexual experience level? (Study 2).

Method Participants
The sample consisted of 512 adolescents and emerging adults (46.9% boys, 53.1% girls), aged between 16 and 20 years (M = 18.12, SD = 1.37). Participants completed a survey on ''adolescent sexuality''andwere assuredof anonymityandtheability to end their participation at any time. Based on a sexual orientation question with a five-point response scale, participants were excluded if they indicated that they were attracted exclusively or mainly to members of their own sex, were attracted to both sexes equally, or were undecided as to their sexual orientation. Using this criterion, 47 participants were excluded from the analyses. The final sample consisted of 465 heterosexual adolescents (45.2% boys, 54.8% girls) (which amounts to 90.8% of the original sample), aged between 16 and 20 years (M = 18.08, SD = 1.34). Sample characteristics are shown in Table 1.

Procedure
Ethical approval for the study was granted by the Child and Adolescent Studies department board at Utrecht University. An online panel enrolled by a commercial party was contracted to recruit community participants for our study. Participants were able to win prizes by participating, but received no financial reward.Theaimwastoobtainasamplethatincludedoftenunderrepresented groups (e.g., non-native Dutch and lower educated participants) in order to adequately reflect Dutch society. The method of data collection provided us with a sample of community adolescents and emerging adults aged between 16 and 20 years. The use of a panel made acquiring this sample more feasible. Moreover, using an Internet panel was of added value because our study involved rather personal questions and the Internet offers relative anonymity. This allowed participants to complete the questionnaire in the comfort and privacy of their ownhomes.Participantstickedaboxstatingthattheyunderstood that the questions would be of a sexual nature and that they wanted to continue to the questionnaire. They were further informed that they could cease their participation at any time.
No parental consent was needed, because the minimum age for completing the questionnaire was 16.

Measures
The proposedscale items were designedwith older sexualdouble standard measures in mind (e.g., Traditional Sexual Attitudes [Kiefer & Sanchez, 2007]; Gender-Equitable Men Scale [Pulerwitz & Barker, 2008]; Male Role Attitudes Scale [Pleck, Sonenstein, & Ku, 1994]; Double Standard Scale, [Caron et al., 1993]; Sexual Double Standard Scale [Muehlenhard & Quackenbush, 1998]) as well as based on empirically and theoretically derived insights from the literature analysis described in the Introduction. We made sure to design items that would be suitable for assessment among heterosexual male and female adolescents and emerging adults (i.e., no difficult wording or too many items describing marriage). In total, we generated 35 items on which participants indicated their degree of agreement on a six-point Likert scale ranging from ''1 = completely disagree'' to ''6 = completely agree.''The items consisted of statements reflecting perceived social norms concerning sexuality for boys and girls.
The study was conducted in the Dutch language. To facilitate readability for an international audience, English translations of the items are given in ''Original 35-Item Pool for the SASSY'' section. The original Dutch item wording can be obtained from the corresponding author upon request.

Demographics
Gender and age Participants indicated their biological sex (male or female) and age. Education Participants answered a question on their current occupation: studying or not studying. They also indicated the highest academic qualification they had attained. If participants' main occupation was studying, the type of education they were followingwas taken as their education level. If participants indicated that they were currently not studying, the highest-level qualification they had obtained was taken as their education level. Education level was categorized as lower (primary school and junior vocational training), intermediate (intermediate education and vocational training), and higher education (pre-university education and university).
Sexual experience Participants answered the question, ''How many people have you had sex with in your life?''on a five-point scale with response categories ranging from ''1 = none'' to ''5 = more than 10.'' A definition of sex was given:''By''sex,''we mean everything from feeling each other naked or caressing each other, to intercourse (penetration of the vagina or anus by the penis).''The responses were then recoded into a binary variable for use in the analyses: no sexual experience (for participants who answered ''none'') versus sexual experience (for participants who answered''one or more'').

Analytical Strategy
We assessed the factor structure and internal consistency of the 35-item pool to determinewhich subsetofitemsformed thebest one-factor and internally consistent, reliable scale for the assessment of sexual double standard endorsement. We employed an exploratory factor analysis to this purpose. There were no missing values; therefore, no missing data handling procedure was needed.

Factor Structure and Internal Consistency
First,the factorstructure andreliabilityof the35-iteminstrument we had constructed were assessed. The scale was subjected to an exploratory factor analysis using principal axis factoring with obliquerotation.TheKaiser-Meyer-Oklinvaluewas.88,which is above the recommended cutoff value of .60 (Kaiser, 1970(Kaiser, , 1974, and Bartlett's Test of Sphericity (Bartlett, 1954) was statistically significant, supporting factorability. Furthermore, upon inspection of the scree plot, a break could be seen after the first component extracted. As the aim was to construct a onedimensional (single factor) measure, we excluded the 11 items that did not load above .40 on the first factor (see ''Original 35-Item Pool for the SASSY''section for a breakdown of which items were excluded in this step). After these necessary scale adjustments had been made, an analysis of internal consistency was conducted with the remaining 24 items, yielding a Cron-bach's alpha of .80. However, analyses indicated that removing an additional four items would greatly increase internal consistency. This yielded a Cronbach's alpha of .90 for the 20-item instrument. In the last step, the factor analysis was repeated, confirming a single-factor solution which explained 34% of the variance.Factorloadingsfortheitems representedinthe 20-item are shown in Table 2.

Study 2
A subsequent study was conducted in two waves to examine whether the factor structure would replicate in a different sample using a slightly broader age group, and to examine the test-retest reliability, construct and convergent validity, and measurement invariance of instrument scores. Test-retest reliability was assessed by comparing scores across the two waves, which were eight weeks apart. Construct validity was addressed by assessing the relationship of participant scores on the new instrument to participant scores on the SDSS (Muehlenhard & Quackenbush, 1998). This scale was chosen because it is widely used for the assessment of sexual double standards (Bordini & Sperb, 2013). We expected there to be a strong positive relationship between the scale scores, because both scales have been designed to measure sexual double standard endorsement. However, we did not expect the relationship to near perfection, because the new instrument was additionally designed for a specific context, a specific target group, and to be more multifaceted compared to previous instruments. Lastly, convergentvalidity was established by assessing the relationship of participant scores on the new instrument to scores on gendered attitudes in another context, namely the family context. We expected to observe a positive weak to moderate relationship between the scale scores, because both scales have been designed to measure gendered attitudes.

Method Participants
The original sample obtained at Wave 1 consisted of 873 adolescents and emerging adults. As in the first study, we excluded participants who indicated that they were attracted mainly or exclusively to members of their own sex, were attracted to both sexes equally, or were unsure of their sexual orientation (n = 55). The final sample used in the analyses consisted of 818 heterosexual adolescents and emerging adults at Wave 1 (this amounts to 93.7% of the original sample). In comparison with Wave 1, a further 202 participants were lost as they did not complete the Wave 2 questionnaire. This led to a final sample used in the analyses for Wave 2 of 616 heterosexual adolescents and emerging adults (this amounts to 70.6% of the original sample, and to 75.3% of the sample analyzed at Wave 1). A comparison between the participants who dropped out between Wave 1 and Wave 2 (N = 202) and participants who completed both waves (N = 616) showed no significant differences in gender, age or scores on the variable of interest (scores on the new instrument). Sample characteristics are shown in Table 3.

Procedure
This study was granted ethical approval by the Ethics Committee of the Faculty of Social and Behavioral Sciences at Utrecht University (Reference: FETC15-003). Data collection was outsourced to CentERdata, Institute for Data Collection and Research, which is attached to Tilburg University, the Netherlands, andwascarried outusingtheLISS (LongitudinalInternet Studies for the Social Sciences) panel. The LISS panel is a representative sample of Dutch individuals who participate in monthly Internet surveys from the comfort of their own homes (in exchange for a small reward). The panel is based on a true probability sample of (approximately 5000) households drawn from the population register. Households without a computer and Internet access are provided these by LISS. A random selection of LISS panel members from those households was invited to participate in the study. The number of eligible candidates in each household varied, according to how many household members subscribe to the panel and whether they fit our age inclusion criterion. The specific sample included thus consisted of a unique draw from the participants in the LISS panel. More information about the panel can be found on their Web site (www.lissdata.nl). The method of data collection provided us with a sampleof community adolescents andemerging adults aged between 16 and 25 years that was large enough for a solid validation process. The use of a panel made acquiring this sample more feasible as there was a known response rate within the panel and adequate oversampling could be provided. Moreover, using an Internet panel was of added value because our study involved rather personal questions and the Internet offers relative anonymity. Participants ticked a box stating that they understood that the questions would be of a sexual nature and that they wanted to continue to the questionnaire. The study was described to them as''a study on young people and sexuality.'' They were further informed that they could cease their participationatanytime.Noparentalconsentwasneeded,becausethe minimum age for completing the questionnaire was 16. Data collection took place between May and July of 2014. To enable test-retest reliability to be examined, the same participants completed the questionnaire in two waves, the second wave taking place eight weeks after the first.

Measures
The revised instrument described in Study 1, now consisting of 20 items, was administered to participants in both Wave 1 and Wave 2. Participants indicated their degree of agreement on a six-point scale ranging from''1 = completely disagree''to''6 = completely agree.''See''Original 35-Item Pool for the SASSY'' section for English-language item wording.
Demographics Gender, age, education level, and sexual experience were assessed in an identical manner to Study 1.

Construct Validity
The Sexual Double Standard Scale (Muehlenhard & Quackenbush, 1998) was included in the survey in order to examine construct validity. The scale contained 20 items on which participants could indicate their degree of agreement on a fourpoint scale ranging from ''1 = completely disagree'' to ''4 = completely agree.'' In our study, we obtained a Cronbach's alpha of .52 for this scale. An example item is: ''It is just as important for a man to be a virgin when he marries as it is for a woman.''

Convergent Validity
Scores on the constructs used to assess convergent validity were taken from the longitudinal database of the LISS panel.
Complete data were available for 504 of the participants who had also completed the new instrument and SDSS questionnaires.
Familygender norms QuestionswerederivedfromtheEuropean Values Study(EVS)(2016).Higherscoresonthismeasure indicated more conservative family gender norms. The scale contained seven items, rated on a five-point scale ranging from''1 = completelydisagree''to''5 = completelyagree.''The items formed a reliable scale, with a Cronbach's alpha of .75 in this study. An example item is:''A child who is not yet attending school is likely to suffer if his or her mother has a job.'' Traditional values Questions were derived from the Euro-peanSocialSurvey(ESS)(2016).Higherscoresonthismeasure indicated more conservative gender norms for child-rearing. The scale contained four items, rated on a five-point scale ranging from''1 = completely disagree''to''5 = completely agree.''The items formed a reliable scale, with a Cronbach's alpha of .72 in this study. An example item is: ''Generally speaking, boys can be brought up more liberally than girls.''

Analytical Strategy
First, the factor structure and internal consistency of the new instrument were reassessed. We employed a confirmatory factor analysis to this purpose. Subsequently, analyses were performedtoascertaintest-retestreliability,constructvalidity,and convergentvalidity.Lastly,measurement(in)variancewasexaminedacrosstime,gender,age,education,sexualexperiencelevel, and ethnicity using confirmatory factor analysis. There were no missing values: therefore, no missing data handling procedure was needed.

Factor Structure
The factor structure of the new instrument was reassessed using confirmatory factor analysis with principal axis factoring. The Kaiser-Meyer-Oklin value was .91 for both Wave 1 and Wave 2, which is above the recommended cutoff value of .60 (Kaiser, 1970(Kaiser, , 1974, and Bartlett's Test of Sphericity (Bartlett, 1954) was statistically significant in both Wave 1 and Wave 2, supporting factorability. The analysis showed that all items, except one,loadedabove.40andsufficientlystrongonthefirstfactorin both Wave 1 andWave 2,supportinga one-factor solution.Item 2 (Girls like boys who take the lead in sex), however, loaded somewhat lower in both Wave 1 and Wave 2. Based on these factorloadings,we decided to drop Item 2.Subsequentanalyses were, therefore, performed with the 19-item instrument. The single factor of the 19-item instrument explained 32% of the variance in Wave 1 and 34% of the variance in Wave 2. Factor loadings are shown in Table 2. This final set of items was named the Scale for the Assessment of Sexual Standards among Youth (SASSY).Thefinal19-iteminstrumentcanbefoundin''Original 35-Item Pool for the SASSY''section. Mean scores on the separate SASSY items as a function of gender are shown in Table 4.

Internal Consistency
The reliability of the 19-item SASSY scale was assessed in both Wave 1 and Wave 2. Cronbach's alphas obtained were well above the cutoff point for a reliable scale: .89 in Wave 1 and .90 in Wave 2.

Test-Retest Reliability
The correlation between the Wave 1 and Wave 2 SASSY data was substantial and highly significant (see Table 5), and within-genderscoresontheSASSYdidnotdiffersignificantlybetween Wave 1 and Wave 2.

Construct Validity
The correlation between the SASSY and SDSS (Wave 1) was large (See Table 5).

Convergent Validity
A small but significant, positive correlation was found between SASSY (in both Wave 1and Wave 2) and Family Gender Norms (see Table 5), indicating that increased sexual double standard endorsement was related to less liberal family gender norms (toward women). A moderate significant positive correlation was found between SASSY (in both Wave 1 and Wave 2) and Owing to the gendered nature of sexual double standards, results were additionally examined separately for gender. No differences in significance levels emerged, and there were only slight variations in correlation strength. Therefore, correlations aggregated over gender are shown TraditionalValues(seeTable 5),indicatingthatincreasedsexual double standard endorsement was related to less liberal gender norms for child-rearing.

Measurement Invariance
Lastly, measurement (in)variance was examined across time, gender, age, education, sexual experience level, and ethnicity using confirmatory factor analysis. We assessed configural invariance(requiresthatmodelfitisacceptableacrossgroups), metric invariance (requires that factor loadings are invariant across groups), and scalar (or strong) invariance (requires that item intercepts are invariant across groups), as proposed by Steenkamp and Baumgartner (1998). As cited in Steenkamp and Baumgartner, measurement invariance refers to''whether or not, under different conditions of observing and studying phenomena, measurement operations yield measures of the same attribute.''In other words, whether a scale assesses true differences between groups or whether differences result from systematicbiases.WeusedthestandardRootMeanSquareError of Approximation (RMSEA) cutoff of\.05, with PCLOSE non-significant, and CFI[.90 to examine goodness of fit. To examine nested model differences, v 2 difference tests were con-ducted. If this test was non-significant, measurement invariance is presumed to be present. However, we additionally used the decrease in CFI between the nested models, because the v 2 difference test is sensitive to sample size, whereas CFI is more robust (Milfont & Fischer, 2015). If the v 2 difference test was significant, but nested models differ by no more than .01 in CFI, measurementinvarianceisconcludedtobepresent,regardlessof the significant v 2 difference test (Cheung & Rensvold, 2002). The fit of the factor model was good: v 2 (131) = 449.518, RMSEA = .055 (PCLOSE .077) and CFI = 0.932. All factor loadings were[.41. As shown in Table 6, the instrument showed configural and metric measurement invariance across gender, age, educational level, sexual experience level, and ethnicity, and configural, metric, and scalar measurement invariance across time.

Discussion
As the concept of the sexual double standard has evolved, along with changes in the display of gendered behavior in dating and sexuality, and negative effects of sexual double standard  (Bordini & Sperb, 2013;Crawford & Popp, 2003;Fugère et al., 2008;Sanchez et al., 2012), the development of modernized methods and measures is warranted. In response to this call, this study proposed a new, multifaceted, and one-dimensional 19-item scale to measure sexual double standard endorsement.
The SASSY demonstrated excellent one-factor model fit, internal consistency, test-retest reliability, convergent, and construct validity, and showed configural and metric measurement invariance across gender, age, education level, and sexual experience level and configural, metric, and scalar measurement invariance across time. Overall, this speaks for the use of the scale in future studies.
Of course, there were also some limitations to our study. First off, we note that no scalar measurement invariance was found across gender, age, education level, and sexual experience level. Strictly speaking, this would mean that (since both configural and metric invariance do hold) assessing structural relationships across variables using the SASSY is advisable, but comparing group means is not. However, measurement invariance is often ignored in (validation) studies altogether, and when it is not, strict forms of invariance (such as scalar invariance) only rarely hold (Van De Schoot, Schmidt, De Beuckelaer, Lek, & Zondervan-Zwijnenburg, 2015). Therefore, we do not see a reason to be overly cautious in comparing group means.
Secondly, although we were mindful of deleting entire themes through the deletion of items based on statistical arguments, for the theme of''gender violations''all items had to be dropped, based on either the factor loadings or the subsequent reliability analysis. However, since this was the only one of the establishedthemesthatdroppedoutcompletelyintheprocessof creating the final instrument, we do not think that the multifaceted nature of the scale was compromised. We additionally assessed whether there were any commonalities to be detected among the deleted items (for instance, a lower degree of variability,comparedtotheretaineditems,thattheyshared),butthis did not appear to be the case.
Thepresentstudyusedself-reportdata.As theinstrumentwas designed to assess the individual's attitude toward perceived social norms concerning sexuality for boys and girls, socially desirable responding can never be ruled out completely. We also note that using either a between-or within-subjects design in sexual double standard research may lead to different results, also regarding socially desirable responding. Both types of designs have previously been used in this field, generally assessingeitherindividualdoublestandards(mostly using withinsubject designs) or perceptions of societal double standards (mostly using between-subject designs) (Crawford & Popp, 2003). Although it remains up for discussion whether the SASSY is more a measure of individual or societal double standards (or a combination), we believe it leans more toward thesocietaldoublestandard.Itcould,therefore,bearguedthatit isan instrument thatis most suitable for assessmentin betweensubject designs (Crawford & Popp, 2003). It is possible, that, as aresultofthegendercomparisoninherentlypresentinthescale, participants may be somewhat more inclined toward socially desirable responding and report a similar standard for boys and girls. Previous research shows that this is, however, not necessarily the case, even when assessing the SDS in a betweensubjects design (Sakaluk & Milhausen, 2012).
In the future, the instrument might be adapted for use in sexual minority samples as well. However, as the sexual double standard is a highly heteronormative phenomenon, it seemed best to focus first on heterosexual populations.
Funding No funding to report.

Compliance with Ethical Standards
Conflict of interest The authors declare that they have no conflict of interest.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecom mons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Appendix: Original 35-Item Pool for the SASSY
Below you will find a number of statements about boys and girls concerning sexuality. The statements refer to boys and girls, but they are also meant for young men and women. Please read the statements carefully and indicate whether or not you agree with each statement. We are only interested in your honest opinion; there are no right or wrong answers.
1. Once a boy is sexually aroused, a girl cannot really refuse sex anymore. 2. Girls like boys who take the lead in sex. c 3. I think that a girl who takes the initiative in sex is pushy. 4. Boys who jump into bed with anyone disgust me. a 5. I think it is more appropriate for a boy than for a girl to date different people at the same time. 6. Girls should act in a more reserved way concerning sex than boys. 7. I think it is more appropriate for a boy than for a girl to have sex without love. 8. A boy should be more knowledgeable about sex than a girl. 9. I think sex is less important for girls than for boys. 10. I think it is normal for boys to take the dominant role in sex. 11. I think sexually explicit talk is more acceptable for a boy than for a girl.
12. Sometimes a boy should apply some pressure to a girl to get what he wants sexually. 13. Girls who jump into bed with anyone disgust me. a 14. It is more important for a girl to keep her virginity until marriage than it is for a boy. 15. There are no longer any differences between men and women when it comes to sexuality. a 16. Romance is more important for girls than for boys. a 17. I think girls can have just as many one night stands as boys. a 18. Boys are more entitled to sexual pleasure than girls. 19. Boys like girls who take the lead in sex. a 20. I admire girls who behave in a masculine way. a 21. It is not becoming for a girl to have unusual sexual desires. 22. I admire a boy who is a virgin when he gets married. a 23. I admire boys who behave in a feminine way. b 24. Sex is more important for boys than for girls. 25. It is more important for a girl to look attractive than it is for a boy. 26. Boys and girls want completely different things in sex. 27. It's best for girls not to have sex with too many different boys, otherwise people will think badly of them. a 28. I think cheating is to be expected more from boys than from girls. 29. I think it's a good thing if boys let girls take the initiative for sex. a 30. Love is really more important than sex for boys. a 31. Sexually active girls create problems in relationships. b 32. I feel sorry for girls who are still virgins when they get married. b 33. I think it is important for a boy to act as if he is sexually active, even if it is not true. 34. Girls are more entitled to sexual pleasure than boys. b 35. I think it is more appropriate for a boy than for a girl to masturbate frequently.
Note. Italicized items are items that have been retained in the final version of the instrument. Original Dutch item wording can be obtained upon request.
a Indicates an item that was removed based on the factor analysis in Study 1.
b Indicates an item that was removed based on the subsequent reliability analysis in Study 1.
c Indicates an item that was removed based on the factor analysis in Study 2.