Corporate disclosure became more descriptive rather than quantitative over time. Thus, textual analysis gained popularity in finance and business, however, it requires massive computing power. The paper presents the panel set of the raw frequencies of positive and negative words across 90,463 Forms 10-K filed at Security Exchange Commission (SEC) in EDGAR (the Electronic Data Gathering, Analysis, and Retrieval system) over the period 1995-2008. The dataset consists of 456 variables. The texts of the forms were retrieved from the SEC servers and processed using text mining techniques. The data relevant for archive analysis on the sentiment of the financial statements and financial reporting on SEC registrants. Potential reuse for creation of the tone or sentiments indexes. Long-time data series allows for dynamic analysis. The data set allows reducing the computer power requirements for further research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9011009PMC
http://dx.doi.org/10.1016/j.dib.2022.108110DOI Listing

Publication Analysis

Top Keywords

security exchange
8
exchange commission
8
positive negative
8
commission forms
4
forms k-10
4
k-10 filings
4
filings positive
4
negative word
4
word occurrence
4
occurrence dataset
4

Similar Publications

Background: Familial hypercholesterolemia (FH) is a hereditary dyslipidemia that confers a severely elevated risk for development of early atherosclerotic cardiovascular disease if left untreated. FH is underdiagnosed in most countries including Sweden.

Aim: To develop and evaluate the implementation of a digiphysical screening model to diagnose FH in the clinical routine.

View Article and Find Full Text PDF

Genomic sources from China are underrepresented in the population-specific reference database. We performed whole-genome sequencing or genome-wide genotyping on 1,207 individuals from four linguistically diverse groups (1,081 Sinitic, 56 Mongolic, 40 Turkic, and 30 Tibeto-Burman people) living in North China included in the 10K Chinese People Genomic Diversity Project (10K_CPGDP) to characterize the genetic architecture and adaptative history of ethnic groups in the Silk Road Region of China. We observed a population split between Northwest Chinese minorities (NWCMs) and Han Chinese since the Upper Paleolithic and later Neolithic genetic differentiation within NWCMs.

View Article and Find Full Text PDF

Changes in Antioxidant and Photosynthetic Capacity in Rice Under Different Substrates.

Biology (Basel)

January 2025

School of Tropical Agriculture and Forestry, Hainan University, Haikou 570100, China.

Against the backdrop of a changing global climate, the soil environment may undergo significant changes, directly affecting agricultural productivity and exacerbating global food security issues. Three different substrates were set up in this study, namely, S (high sand and low nutrient content), T (medium sand and medium nutrient content), and TT (low sand and high nutrient content). The results showed that the root/shoot ratio increased as the sand content increased (nutrient content decreased).

View Article and Find Full Text PDF

Population genetic structure of Phaedranassa cinerea Ravenna (Amaryllidaceae) and conservation implications.

BMC Plant Biol

January 2025

Centro de Investigación de La Biodiversidad y Cambio Climático (BioCamb), y Facultad de Ciencias de Medio Ambiente, Universidad Tecnológica Indoamérica, Machala y Sabanilla, Quito, Ecuador.

Background: Andean orography has shaped the endemism of plant species in montane forests, creating a mosaic of habitats in small and isolated areas. Understanding these endemic species' genetic diversity patterns is crucial for their conservation. Phaedranassa cinerea (Amaryllidaceae), a species restricted to the western Andes of Ecuador, is listed as "vulnerable" according to the IUCN criteria.

View Article and Find Full Text PDF

Genetic diversity is crucial to secure the survival and sustainability of ecosystems. Given anthropogenic pressure, as well as the projected alterations connected with the level and circulation of water, riparian forests are of particular concern. In this paper, we assessed the genetic variation of black poplar - one of the keystone tree species of riverine forests.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!