Category: Election Data Analysis

Validating Ed Solomon’s Observations on CO 2020 data

Post author By Jonathan Lareau
Post date May 26, 2025
No Comments on Validating Ed Solomon’s Observations on CO 2020 data

Recently I was made aware of the work Ed Solomon had been doing with data from the 2020 Colorado Cast Vote Records (CVRs), and I’ve taken some time to replicate and validate some of his data observations. I don’t always agree with Ed, but I wanted to take some time and verify the facts of the matter for myself.

For background, CVRs are machine logs of the way the tabulators process the “cast” ballots. You can think of them as equivalent to your bank statement showing all of the recorded transactions for each ballot scanned. They are required to be producible by ballot electronic tabulation systems, and are used as part of official forensic audits and documentation. They do not have any personal information and simply operate on the content of individual ballots as they are processed.

There are 2 specific items that need to be validated here:

Odd statistics associated with statewide ballot measures in Arapahoe County as compared to other counties. Specifically, there were two statewide ballot measures (one dealing with taxes, and another on abortion) that one would expect to show a significant partisan split, and we in fact do see such a split in neighboring El Paso and Adams counties. However, the ballot measures do not show the partisan split in Arapahoe County.
- The difference is not just that the partisan split is muted or reduced, it is a night and day difference. In Arapahoe county there is almost no statistical difference between Trump and Biden voters on the ballot measures, but there is an obvious and clear difference on the same ballot measures in neighboring Adams and El Paso counties.
- Why is this important? It raises questions as to the veracity of the election counts, data handling practices, and the ability to use CVRs for their intended forensic purpose.
The fact that the Arapahoe County CVR data was changed on the official county website without any notification or explanation around Feb 2025. The internal composition of ballots was changed in the data and “scrambled” by Arapahoe county … with the new version of the CVR files no longer showing the inconsistency from #1.
- As CVRs are official records that are used for legal purposes such as audits etc., they should never be “quietly” changed or modified retroactively. A full and transparent explanation of the issues and steps made to remedy should accompany any updates for official documents such as CVRs.
- This change took place years after the CVR was originally produced, and after Ed Solomon had used this particular CVR as part of his supporting documentation in an election case (Thompson vs Secretary of State NV) in Nevada.
  - The county was fully aware of the use of these records in the Nevada case.
  - The county CVRs had already been used in a previous audit of the 2020 election, where ballots from a specific tabulator and batch were pulled and compared to the cast vote record for accuracy. (see here, and here)
- After Ed and Mark noticed the retroactive modifications and started asking questions, the County released a statement explaining that they were contacted by a researcher about potential issues with their CVR regarding “redaction” and privacy concerns.
  - However, the county statement gives an incompatible date for when the documents were “corrected”, according to the file timestamps and internet archive logs. The statement claims April 2nd 2025, whereas the contents of the uploaded file show Feb 20th 2025 as the file modification date.
  - The counties explanation does not comport with the observation of the scrambling of internal contents of official ballots. Its not just that the ordering of ballots was randomized to assuage privacy concerns, but that the actual records of votes cast were being swapped between ballots.
- At BEST this shows a woeful lack of transparency and procedural safeguards by the county.
- At WORST this has the appearance of being intentional tampering with official records.

I can independently replicate and validate both of these data observations. There does seem to be an issue with the ballot measures in the original Arapahoe County CVR data, and that data has been retroactively modified by the county in such a way as to scramble the information associated with votes cast.

Note that Ed Solomon, Draza Smith, Jeff O’Donnell (a.k.a. “The Lone Raccoon”), Mark Cook, MadLiberals, and others all provided data and pointers to the original documents and URLs in question, as well as their own analysis on the X.com platform.

Starting with item # 1:

Arapahoe County makes their CVR data available on their website (https://arapahoevotes.gov/transparency/). To obtain the new (modified) file click on the “Certification & Recounts” tab and scroll down to find the CVR link on the left hand side. Direct link is here: https://www.arapahoevotes.gov/file/2020-general-election-redacted-cast-vote-record

The original data is also still available on the Arapahoe County website, but one needs to do some creative sleuthing via the wayback machine and looking at the URL links in order to get to it, as was done by Mark Cook. (see: here and here)

The link to the original CVR file is: https://gis.arapahoegov.com/DL_Data/Redacted_CVR/Redacted_CVR_Export.zip

The original data from the 2020 CVR data had also been collected and collated by Jeff O’Donnell on his https://votedatabase.com (formerly ordros.com) site, which I archived and versioned in Sept of 2022. I can confirm that the original file matches the files in the votedatabase archive, as well as the current votedatabase site. I used additional files from votedatabase archive for neighboring Adams and El Paso counties as a source for the rest of this work, as I could not find corresponding links to CVR downloads on the Adams or El Paso county websites.

Ed’s original observation was that the two ballot measures that *should* be partisan split were not when looking at the original 2020 CVR Arapahoe County data. He used this observation as supporting evidence showing inconsistencies and irregularities in 2020 election data in an court case (Thompson vs Secretary of State NV) he was providing analysis for in Nevada. All of his analysis and the original file have therefore been previously submitted to the court.

Amendment B was a repeal of the “Gallagher Amendment” dealing with property tax rates and was expected to have a highly partisan split. Likewise Proposition 115 dealt with abortion and was also expected to have highly partisan split.

If we look at the plots of the ballots cast for these two ballot measures, but we condition them on if the person voted for Trump or Biden at the top of the ticket we do see in neighboring counties such as El Paso and Adams counties this partisan split, as shown below. Note the significant spread between the Biden/No (Yellow) & Trump/No (Purple), as well as between the Biden/Yes (Blue) & Trump/Yes (Red).

As can be seen in the plots above from Adams and El Paso counties, there is a significant partisan split in these two down-ballot races when conditioned on how the top of the ticket votes. However this seems to vanish when looking at Arapahoe County, with the Biden/No (Yellow) & Trump/No (Purple) and the Biden/Yes (Blue) & Trump/Yes (Red) stacking almost completely on top of one another.

It can be clearly seen in the plots that the partisan split that was present in the other counties results seems to have completely vanished in Arapahoe.

I will note that the partisan split seems to be missing from almost all down-ballot races that I looked at, not just these two, although these were the ones specifically called out by Ed. This is an important point that I will come back to in a minute.

… And now to item # 2:

Ed’s original observation was submitted as part of his case in Nevada. At one point he and Mark Cook attempted to make a live stream video showing how people could recreate the observations starting from the source documents on the Arapahoe County website, which is when he and Mark realized that the original CVR file on the county website had been quietly replaced with a new file that had its contents scrambled and the results no longer showed the observed pattern.

Note that a CVR file is a legally required forensic record. It is the equivalent of a bank transaction log, and should almost never have its contents manipulated. If an error is discovered, and a correction does need to be issued, then a new file with the corrections should be published along side the original with a clear and prominent explanation and notification of the change. In this case, however, the County simply replaced the link to the original file with the new file with no explanation and no notice.

It was only after this was discovered, and after Ed started making phone calls to the County and bringing up the issue with the Judge in his Nevada case, that Arapahoe County belabouredly published an admission that they had adjusted the file. Their excuse for the modification was that they were made aware of a mistake with their “redaction” of data in the original publication, and were worried about individual privacy.

The (new) altered file did have 16 specific ballots that had their down-ballot races zeroed out, and was missing the “CountingGroup” metadata column. However, the file didn’t just have a small number of ballots (16) down-ballot information omitted, the internal contents on ALL ballots were also completely scrambled, with the top-of-the-ticket entries for President and Senate and metadata columns being completely reordered from all of the down-ballot information. This split-scrambling also also “fixed” the observed issues with Amendment B and Proposition 115, as can be seen below, where there is now a distinct partisan split between the data trends.

This kicked off multiple efforts to reverse engineer the actual changes that were performed on the CVR data by Arapahoe county, by myself and multiple others. Jeff O’Donnell and MadLiberals on X made the observation of the split reordering. I was able to verify this and remove the split-shuffling, exposing the fact that there were 16 ballots that also had all of their down ballot information zeroed out. There were a total of 432 down ballot votes that were removed from 16 specific ballots, followed by ALL of the President and Senate votes for ALL ballots being scrambled in relation to their down-ballot races.

Back to the point I made before above … the “scrambling” of the new file does seem to have “fixed” the expected partisan nature of most of the down-ballot races, so it is not unreasonable to think that this was actually a “fix” for a processing error on the original CVR file. That being said, the original (assumed incorrect) CVR was used in an audit process of two down-ballot races (linked above). Why did they not catch this issue years earlier during the audit? And why did they make the change to files, under the pretense of privacy issues, without announcing and documenting the errors?

Conclusion:

I can verify the two main data issues documented by Ed Solomon on the Arapahoe County 2020 CVR data.

The original data file had significant issues with down-ballot races not showing the expected partisan splits.

Arapahoe county did “quietly” revise the data without explanation until it was discovered by Ed and Mark, and then when pushed, only acknowledged that there was an issue with redactions and data privacy concerns.

The fact that the modification did correct the expected partisan split for ALL down-ballot races lends some credence to the assertion that they were correcting an error/issue with the original CVR file, however it does not excuse the fact that they performed this correction without notice or explanation. It also does not explain how their 2020 audit was able to use the incorrect original CVR files and not catch any of these issues.

The CVR files are intended to be official forensic records. If they are subject to manipulation and “adjustments” without transparency then that brings into question the validity of those files as forensic devices in the first place.

Corrections:

(5/28/2025) Typo correction in that the original posting of this article had “April 2 2024” as the data the Arapahoe county statement said they changed the CVR. That was corrected to be “April 2 2025”.
(5/28/2025) I had the wrong reference for the associated case in NV. I had originally posted that the case was the “Gilbert” case. It is actually “Thompson vs State”. Links have been updated accordingly.
(5/28/2025) Minor spelling errors corrected.

Election Data Analysis technical

2025 VA June Republican Primary Election DAL File Metrics

Post author By Jonathan Lareau
Post date May 4, 2025
No Comments on 2025 VA June Republican Primary Election DAL File Metrics

Below you will find the current summary data and graphics from the 2025 VA June Republican Primary Election Daily Absentee List files. We pull the DAL file everyday and track the count of each specific ballot category in each daily file.

Note: Page may take a moment to load the graphics objects.

Linear Scale Plot:

Print Copy Download

Place your cursor over the series name in the legend at right to see the series highlighted in the graphic. Place your cursor over a specific data point to see that data points value.

Logarithmic Scale Plot:

Print Copy Download

The logarithmic plot is the same underlying data as the linear scale plot, except with a logarithmic y-scale in order to be able to compress the dynamic range and see the shape of all of the data curves in a single graphic. Place your cursor over the series name in the legend at right to see the series highlighted in the graphic. Place your cursor over a specific data point to see that data points value.

Summary Data Table:

Print CSV Copy

The underlying data for the graphics above is provided in the summary data table.

Additional Data:

Additional CSV datasets stratified by Locality, City, Congressional District, State House District, State Senate District, and Precinct are available here.

Data column descriptions:

“ISSUED” := Number of DAL file records where BALLOT_STATUS= “ISSUED”
“NOT_ISSUED” := Number of DAL file records where BALLOT_STATUS= “NOT ISSUED”
“PROVISIONAL” := Number of DAL file records where BALLOT_STATUS= “PROVISIONAL” and APP_STATUS=”APPROVED”
“DELETED” := Number of DAL file records where BALLOT_STATUS= “DELETED”
“MARKED” := Number of DAL file records where BALLOT_STATUS= “MARKED” and APP_STATUS=”APPROVED”
“ON_MACHINE” := Number of DAL file records where BALLOT_STATUS= “ON_MACHINE” and APP_STATUS=”APPROVED”
“PRE_PROCESSED” := Number of DAL file records where BALLOT_STATUS= “PRE-PROCESSED” and APP_STATUS=”APPROVED”
“FWAB” := Number of DAL file records where BALLOT_STATUS= “FWAB” and APP_STATUS=”APPROVED”
“MAIL_IN” := The sum of “MARKED” + “PRE_PROCESSED”
“COUNTABLE” := The sum of “PROVISIONAL” + “MARKED” + “PRE_PROCESSED” + “ON_MACHINE” + “FWAB”
“MILITARY” := Number of DAL file records where VOTER_TYPE= “MILITARY”
“OVERSEAS” := Number of DAL file records where VOTER_TYPE= “OVERSEAS”
“TEMPORARY” := Number of DAL file records where VOTER_TYPE= “TEMPORARY”
“MILITARY_COUNTABLE” := Number of DAL file records where VOTER_TYPE= “MILITARY” and where COUNTABLE is True
“OVERSEAS_COUNTABLE” := Number of DAL file records where VOTER_TYPE= “OVERSEAS” and where COUNTABLE is True
“TEMPORARY_COUNTABLE” := Number of DAL file records where VOTER_TYPE= “TEMPORARY” and where COUNTABLE is True

All data purchased by Electoral Process Education Corp. (EPEC) from the VA Dept of Elections (ELECT). All processing performed by EPEC.

If you like the work that EPEC is doing, please support us with a donation.

Election Data Analysis technical

2025 VA June Democratic Primary Election DAL File Metrics

Post author By Jonathan Lareau
Post date May 4, 2025
No Comments on 2025 VA June Democratic Primary Election DAL File Metrics

Below you will find the current summary data and graphics from the 2025 VA June Democratic Primary Election Daily Absentee List files. We pull the DAL file everyday and track the count of each specific ballot category in each daily file.

Note: Page may take a moment to load the graphics objects.

Linear Scale Plot:

Print Copy Download

Place your cursor over the series name in the legend at right to see the series highlighted in the graphic. Place your cursor over a specific data point to see that data points value.

Logarithmic Scale Plot:

Print Copy Download

Summary Data Table:

Print CSV Copy

The underlying data for the graphics above is provided in the summary data table.

Additional Data:

Additional CSV datasets stratified by Locality, City, Congressional District, State House District, State Senate District, and Precinct are available here.

Data column descriptions:

“ISSUED” := Number of DAL file records where BALLOT_STATUS= “ISSUED”
“NOT_ISSUED” := Number of DAL file records where BALLOT_STATUS= “NOT ISSUED”
“PROVISIONAL” := Number of DAL file records where BALLOT_STATUS= “PROVISIONAL” and APP_STATUS=”APPROVED”
“DELETED” := Number of DAL file records where BALLOT_STATUS= “DELETED”
“MARKED” := Number of DAL file records where BALLOT_STATUS= “MARKED” and APP_STATUS=”APPROVED”
“ON_MACHINE” := Number of DAL file records where BALLOT_STATUS= “ON_MACHINE” and APP_STATUS=”APPROVED”
“PRE_PROCESSED” := Number of DAL file records where BALLOT_STATUS= “PRE-PROCESSED” and APP_STATUS=”APPROVED”
“FWAB” := Number of DAL file records where BALLOT_STATUS= “FWAB” and APP_STATUS=”APPROVED”
“MAIL_IN” := The sum of “MARKED” + “PRE_PROCESSED”
“COUNTABLE” := The sum of “PROVISIONAL” + “MARKED” + “PRE_PROCESSED” + “ON_MACHINE” + “FWAB”
“MILITARY” := Number of DAL file records where VOTER_TYPE= “MILITARY”
“OVERSEAS” := Number of DAL file records where VOTER_TYPE= “OVERSEAS”
“TEMPORARY” := Number of DAL file records where VOTER_TYPE= “TEMPORARY”
“MILITARY_COUNTABLE” := Number of DAL file records where VOTER_TYPE= “MILITARY” and where COUNTABLE is True
“OVERSEAS_COUNTABLE” := Number of DAL file records where VOTER_TYPE= “OVERSEAS” and where COUNTABLE is True
“TEMPORARY_COUNTABLE” := Number of DAL file records where VOTER_TYPE= “TEMPORARY” and where COUNTABLE is True

All data purchased by Electoral Process Education Corp. (EPEC) from the VA Dept of Elections (ELECT). All processing performed by EPEC.

If you like the work that EPEC is doing, please support us with a donation.

Election Data Analysis Election Forensics Election Integrity technical

Non-citizen registrations with previous voting history in VA election data – update March 2025

Post author By Jonathan Lareau
Post date March 2, 2025
No Comments on Non-citizen registrations with previous voting history in VA election data – update March 2025

We have updated our previous analysis (see March, July, Sept, Oct, Nov and Dec posts) with the latest information from the VA Department of Elections data.

Abstract:

Using the data provided by the VA Department of Elections (ELECT), we have identified at least 4,400 unique registrations that were self-identified as “Declared Non-Citizen” and removed by ELECT from the voter rolls since May of 2023. Of those 4,400 removals there were 683 that also had corresponding records of recent ballots cast at some point in the official Voter History record that we could observe. There were 1,775 associated ballots cast identified since Feb of 2019. There were an additional 9 non-citizen registrations and ballots as per the Daily Absentee List (DAL) data, that were not contained in the Voter History data. The total number of identified non-citizen ballots cast is therefore 1,784 by 692 registrants when combining unique VHL and DAL identifications.

After our March 2024 post on this topic, we submitted all of the relevant information that we had at the time to the VA AG’s office. We have not heard any response or update on the matter since that time, besides this being considered an active investigation. We subsequently sent our July results as well to the same contact at the AG’s office, but have had no response.

The Arlington County VA Electoral Board undertook their own investigation into this matter after our previous results were posted, and they recently (as of Sept 10 2024) voted 3-0 to send the information to the AG’s office as well. The Arlington County Commonwealths Attorney also is reported to have an ongoing investigation into the matter. Similar efforts are underway in multiple other counties, including Loudoun and Fairfax counties, to name a few.

https://www.gazetteleader.com/arlington/news/investigation-launched-have-non-citizens-voted-in-arlington-9379534

https://www.gazetteleader.com/arlington/news/va-attorney-general-to-be-alerted-on-possible-non-citizen-voting-9504753

In October the League of Women Voters along with the US DOJ sued VA over this issue and attempted to get an injunction to place ~1,600 of these removed registrations back on the Voter roll. After two lower courts granted the injunction, it was stayed by the US Supreme Court. It is EPEC’s opinion that the State of VA is correctly applying the law in this matter, as I detailed in a X.com post on Oct 12.

Background:

The VA Department of Elections continuously tries to identify and remove invalid or out of date registration records from the voter rolls. One category used for removal is if a registrant has been determined to be a non-citizen. It is required by the VA Constitution that only citizens are allowed to vote in VA elections.

In elections by the people, the qualifications of voters shall be as follows: Each voter shall be a citizen of the United States, shall be eighteen years of age, shall fulfill the residence requirements set forth in this section, and shall be registered to vote pursuant to this article. …VA Constitution, Article II, Section 1. https://law.lis.virginia.gov/constitution/article2/section1/

Additionally, according to VA Code Section 24.2-1004, the act of knowingly casting a ballot by someone who is not eligible to vote is a Class 6 felony.

A. Any person who wrongfully deposits a ballot in the ballot container or casts a vote on any voting equipment, is guilty of a Class 1 misdemeanor.

B. Any person who intentionally (i) votes more than once in the same election, whether those votes are cast in Virginia or in Virginia and any other state or territory of the United States, (ii) procures, assists, or induces another to vote more than once in the same election, whether those votes are cast in Virginia or in Virginia and any other state or territory of the United States, (iii) votes knowing that he is not qualified to vote where and when the vote is to be given, or (iv) procures, assists, or induces another to vote knowing that such person is not qualified to vote where and when the vote is to be given is guilty of a Class 6 felony.https://law.lis.virginia.gov/vacode/title24.2/chapter10/section24.2-1004/

ELECT makes available for purchase by qualifying parties various different data sets, including the registered voter list (RVL) and the voter history list information file (VHL). Additionally, ELECT makes available a Monthly Update Service (MUS) subscription that is published at the beginning of each month and contains (almost) all of the Voter List changes and transactions for the previous period.

In the MUS data there is a “NVRAReasonCode” field that is associated with each transaction that gives the reason for the update or change in the voter record. This is in accordance with the disclosure and transparency requirements in the NVRA. One of the possible reason codes given for records that are removed is “Declared Non-Citizen.”

EPEC has been consistently purchasing and archiving all of these official records as part of our ongoing work to document and educate the public as to the ongoing operations of our elections. (If your interested in supporting this work, please head on over to our donation page, or to our give-send-go campaign to make a tax-deductible donation, as these data purchases are not cheap!)

EPEC looked at the number of records associated with unique voter identification numbers that had been identified for removal from the voter record due to non-citizenship status, per the entries in the MUS, and correlated those results with our accumulated voter history list information in order to determine how many non-citizen registrations had corresponding records of ballots cast in previous elections. We only considered those records that are currently in a non-active state as of the latest MUS transaction log, as some determinations of non-citizenship status in the historical MUS transaction log might have been due to error and subsequently corrected and reinstated to active status. That is, and we emphasize here that we are not considering those records that had a “Declared Non-Citizen” disqualification, but were then subsequently reinstated and reactivated by ELECT.

Note that while EPEC has periodically purchased full copies of the Voter History List for our archive, there is a known issue with the way ELECT handles removals from the voter record that can cause sampling issues depending on the time the VHL file is purchased, and records of legitimately cast ballots to not be present in the VHL: Namely, when ELECT removes a voter from the voter list, they also remove all instances of that voter ID from voter history information and other data files provided to qualified organizations. (IMO … thats a terrible way to manage the data, but that is the way it is done.) In light of that, EPEC also used its archived versions of the Daily Absentee List (DAL) for recent elections in order to attempt to find records of votes cast that might otherwise be missing from the VHL.

Results:

There were 4,400 unique voter records marked for removal with the reason of “Declared Non-Citizen” and not subsequently reinstated in the accumulated MUS record that EPEC began collecting in mid-2023. Of those 4,400 there were 683 that also had corresponding records of recent ballots cast at some point in the official Voter History record that we could observe. There were 1,775 associated ballots cast identified since Feb of 2019. Figure 1 shows the distribution of non-citizen voters in the cumulative MUS file history. The blue trace represent the total identified and CANCELED non-citizen registrations, and the yellow trace represents the number of those records that also had corresponding records in the accumulated voter history data.

Figure 1: Distribution if the number of identified non-citizen records and ballots in the cumulative ELECT MUS file history. The x-axis is the date that a record was marked as CANCELED for the reason of “Declared Non-Citizen”.

Note that the data contained in the MUS updates often covers more than a single month period. In other words, the individual MUS files are oversampled. Subsequent MUS files can therefore also have repeated entries from previous versions, as their data may overlap. Our analysis used the first unique entry for a given voter ID marked as “Declared Non-Citizen” in the cumulative MUS record, that had not been subsequently reinstated, in order to build Figure 1. This data oversampling in the MUS may help explain the small relative increase in the May 2023 bin compared to subsequent months.

As VHL information can be incomplete depending on the time the VHL data was purchased in relation to the time that registrants were removed from voter records, EPEC also checked these non-citizen removals against the archived history of Daily Absentee List (DAL) files that EPEC has accumulated. There were an additional 9 non-citizen registrations and ballots as per the Daily Absentee List (DAL) data that were not contained in the Voter History data. The total number of identified non-citizen ballots cast is therefore 1,784 by 692 registrants when combining unique VHL and DAL identifications.

These identifications represent only the individuals who declared themselves as non-citizen status through official interactions with ELECT, DMV, or other agencies. Each removed registrant was then contacted by the registrar to confirm their non-citizen status.

The distribution of identified unique voter ID’s for the 683 identified non-citizen voters per VA locality is given below in Table 1. It should be noted that each ballot record has a specific locality associated with where the ballot was cast, whereas unique individuals might move between localities over time. The assignment of unique identified individuals to each locality in table 1 is therefore based on the locality listed in the specific MUS “Declared Non-Citizen” record for that individual, while the assignment of ballot cast to Localities is based on the individual VHL/DAL records. A person could have lived and voted multiple times in one county, then moved to another county and voted again before finally being determined as a non-citizen. The same person would have generated multiple VHL/DAL records for each ballot cast, and associated with potentially different localities. This should be kept in mind when attempting to interpret Table 1.

LOCALITY_NAME	REMOVED	VOTED_VHL	VOTES_VHL	VOTED_DAL_NOT_IN_VHL
ACCOMACK COUNTY	7	0	0	0
ALBEMARLE COUNTY	58	13	39	0
ALEXANDRIA CITY	196	27	56	0
AMELIA COUNTY	3	2	8	0
AMHERST COUNTY	1	0	0	0
APPOMATTOX COUNTY	1	0	0	0
ARLINGTON COUNTY	107	14	39	0
AUGUSTA COUNTY	20	5	10	0
BEDFORD COUNTY	14	3	5	0
BOTETOURT COUNTY	3	0	0	0
BRISTOL CITY	1	0	0	0
BRUNSWICK COUNTY	3	2	5	0
BUCKINGHAM COUNTY	3	0	0	0
CAMPBELL COUNTY	0	0	0	0
CAROLINE COUNTY	11	3	10	0
CARROLL COUNTY	11	4	14	0
CHARLES CITY COUNTY	4	2	4	0
CHARLOTTE COUNTY	1	0	0	0
CHARLOTTESVILLE CITY	28	3	9	0
CHESAPEAKE CITY	104	24	58	0
CHESTERFIELD COUNTY	275	28	70	0
CLARKE COUNTY	8	3	4	0
COLONIAL HEIGHTS CITY	15	1	3	0
COVINGTON CITY	2	0	0	0
CRAIG COUNTY	1	0	0	0
CULPEPER COUNTY	32	4	12	0
CUMBERLAND COUNTY	1	0	0	0
DANVILLE CITY	25	4	12	0
DICKENSON COUNTY	1	0	0	0
DINWIDDIE COUNTY	9	1	1	0
EMPORIA CITY	2	0	0	0
ESSEX COUNTY	3	1	3	0
FAIRFAX CITY	9	3	13	0
FAIRFAX COUNTY	778	116	269	2
FALLS CHURCH CITY	1	1	2	0
FAUQUIER COUNTY	30	4	11	0
FLUVANNA COUNTY	2	1	1	0
FRANKLIN CITY	1	0	0	0
FRANKLIN COUNTY	2	1	4	0
FREDERICK COUNTY	33	4	3	0
FREDERICKSBURG CITY	25	2	4	0
GALAX CITY	2	0	0	0
GILES COUNTY	2	0	0	0
GLOUCESTER COUNTY	2	1	1	0
GOOCHLAND COUNTY	5	0	0	0
GRAYSON COUNTY	2	0	0	1
GREENE COUNTY	8	1	2	0
HALIFAX COUNTY	3	0	0	0
HAMPTON CITY	69	16	36	0
HANOVER COUNTY	17	2	8	0
HARRISONBURG CITY	100	5	6	0
HENRICO COUNTY	108	2	18	0
HENRY COUNTY	1	0	2	0
ISLE OF WIGHT COUNTY	1	0	2	0
JAMES CITY COUNTY	37	5	14	0
KING AND QUEEN COUNTY	1	1	4	0
KING GEORGE COUNTY	6	0	0	0
KING WILLIAM COUNTY	1	0	0	0
LANCASTER COUNTY	1	1	0	0
LOUDOUN COUNTY	268	58	132	0
LOUISA COUNTY	9	0	2	0
LUNENBURG COUNTY	1	0	0	0
LYNCHBURG CITY	26	5	9	0
MADISON COUNTY	0	0	0	0
MANASSAS CITY	56	5	13	0
MANASSAS PARK CITY	24	1	5	0
MARTINSVILLE CITY	7	1	0	0
MECKLENBURG COUNTY	10	4	13	0
MIDDLESEX COUNTY	2	0	0	0
MONTGOMERY COUNTY	9	3	15	1
NELSON COUNTY	2	0	0	0
NEW KENT COUNTY	4	1	0	0
NEWPORT NEWS CITY	125	26	65	0
NORFOLK CITY	113	18	54	0
NORTHAMPTON COUNTY	1	0	1	0
NORTHUMBERLAND COUNTY	2	1	1	0
NORTON CITY	1	0	0	0
NOTTOWAY COUNTY	4	0	0	0
ORANGE COUNTY	3	1	4	0
PATRICK COUNTY	3	1	2	0
PETERSBURG CITY	30	4	10	0
PITTSYLVANIA COUNTY	9	2	4	0
PORTSMOUTH CITY	50	14	44	0
POWHATAN COUNTY	4	0	1	0
PRINCE EDWARD COUNTY	11	3	11	0
PRINCE GEORGE COUNTY	17	0	0	0
PRINCE WILLIAM COUNTY	483	77	186	1
PULASKI COUNTY	8	1	2	0
RAPPAHANNOCK COUNTY	3	0	0	0
RICHMOND CITY	201	30	98	1
RICHMOND COUNTY	0	0	0	0
ROANOKE CITY	92	6	7	0
ROANOKE COUNTY	19	2	3	0
ROCKBRIDGE COUNTY	0	0	1	0
ROCKINGHAM COUNTY	21	5	13	0
RUSSELL COUNTY	3	1	1	0
SALEM CITY	4	1	1	0
SCOTT COUNTY	2	1	4	0
SHENANDOAH COUNTY	22	2	6	0
SMYTH COUNTY	2	0	0	0
SPOTSYLVANIA COUNTY	74	4	10	0
STAFFORD COUNTY	124	20	41	3
STAUNTON CITY	4	0	0	0
SUFFOLK CITY	44	16	44	0
SURRY COUNTY	1	0	0	0
SUSSEX COUNTY	2	1	3	0
TAZEWELL COUNTY	7	2	8	0
VIRGINIA BEACH CITY	199	34	133	0
WARREN COUNTY	18	3	5	0
WASHINGTON COUNTY	5	2	6	0
WAYNESBORO CITY	3	0	0	0
WESTMORELAND COUNTY	3	0	0	0
WILLIAMSBURG CITY	13	2	3	0
WINCHESTER CITY	25	2	2	0
WISE COUNTY	2	1	2	0
WYTHE COUNTY	3	0	0	0
YORK COUNTY	25	12	48	0
	4400	683	1775	9

The distribution of the 1,775 ballots that were identified as being cast by non-citizen voters (the yellow trace in Figure 1) in previous elections is shown in Figure 2. The most significant spikes are in the 2019, 2020, 2021 and 2022 November General elections, as well as the 2020 March Democratic presidential primary. Figure 3, which shows this distribution as a percentage of votes cast. Please note the scale of the Y-axis on the percent plot in Figure 3 is in percent of total ballots cast in each election. These graphs were only produced for the VHL data, and do not include the DAL identified records.

Figure 2: Distribution of identified non-citizen ballots cast in previous elections.

Figure 3: Distribution of identified non-citizen ballots cast in previous elections as percent of total ballots cast, according to entries in the VHL/DAL data files.

Figures 4 and 5 show the distribution of the registration dates of the identified non-citizen records. The same data is plotted in figure 4 and 5, with the only difference being the scale of the Y-axis in order to better observe the dynamic range of the values. When we look at the registration date of these identified records, we see that there is a distinct relative increase starting around 1996, and then again around 2012.

Figure 4: Registration dates of the identified non-citizen records. Absolute count on y-axis.

Figure 5: Registration dates of the identified non-citizen records. Logarithmic Y-axis scale.

EPEC made a FOIA request to the VA Attorney General’s office on March 11, 2024 inquiring for any records regarding how many prosecutions for non-citizen voting had occurred since June of 2023. We received a response that the AG had no such relevant records.

EPEC subsequently submitted our March analysis dataset to the VA AG’s office upon their request. We have heard no updates or status as to any action taken by the AG’s office since that time, except that it is being considered an ongoing investigation.

Discussion

It appears from the MUS data, that the VA Department of Elections (ELECT) is doing routine identification, cleanup and removal of non-citizen registrations, which is a good thing and we commend them for their continued efforts to maintain clean voter registration lists.

Given the current court cases filed by LWV and DOJ against the state of VA on the matter, it is important to note that the records identified in the MUS are only those resulting from individuals self-identifying as a non-citizen via interactions with DMV, ELECT, or other official avenues, and that the data presented here specifically excludes those individuals that were subsequently reinstated onto the voter rolls.

The fact that a small number of these identified non-citizen registrations are also associated with (presumably … if the data from ELECT is accurate) illegally cast ballots in previous elections does raise a number of questions that citizens should be (politely) asking and discussing with their legislators, elected and appointed government officials. Each act of non-citizen voting is a de-facto disenfranchisement of legal voters rights, and is a punishable offense under VA law.

Q: How did these registrants get placed onto the voter rolls in the first place?

Q: What method and/or data sources are used by the state to identify non-citizen registrations for removal? If that process is exhaustive, and covers all registrations, then these numbers might be considered to represent a statistical complete picture of the problem. If that process is not exhaustive, in that it only uses serendipitous corroborating data sources, then these results likely under-represent the scale of the issues.

Q: As noted above, we are only considering here those individuals who have not had their records re-instated or reactivated after a determination of non-citizen status. We do not have enough information to determine how or why some records were first determined to be non-citizen, canceled and then subsequently re-instated. One potential area of concern is determining whether or not registrants might be falsely or errantly claiming to not be a citizen on official documents in order to be excused from jury duty, for example, and then work to re-instate their voting status once those documents percolate through the system to ELECT and are flagged for removal. This is a wholly separate but serious issue, as making false claims on official documents is itself a punishable offense.

Q: What procedures, processes and technical solutions are in place to prevent current or future registration and casting of ballots by non-citizens? This is especially pertinent given the recent state of the flow of illegal immigrants crossing our national borders. According to a recent report by Yahoo Finance, VA is one of the top 30 destinations for illegal migrants, with both Loudoun County and Fairfax making the list.

Q: Why have none of the identified non-citizens who also cast ballots been investigated or prosecuted under VA Code 24.2-1004? As the identification of these ballots comes directly from looking at the official records produced by ELECT, it seems prudent for these to be forwarded by ELECT to the AG’s office with a recommendation to investigate and prosecute. Yet our FOIA request to the VA AG’s office inquiring as to any records associated with these types of investigations or prosecutions produced a “no relevant records exist” response. And since we submitted this information to the AG’s office, there has been no follow up.

Additionally, this evidence which is derived from only official state records, directly contradicts multiple news media reports and attestations that non-citizen voting is a “Myth”, and that non-citizen voting happens “almost never”. If the data from ELECT is accurate, then there are at least 1,775 ballots that have been cast by non-citizen voters just since 2019. Now, that is still very infrequent, but it is not “almost never.” It is a legitimate concern … and these discoveries are only the registrations that have been found and removed from the voter roles by ELECT and that we can observe in the data. We do not know how many exist that we do not know about.

It should be reiterated that these are only the records that we can observe given our data repository, and how often we can realistically purchase and acquire voter history and voter registration information. It is therefore likely that this represents a significant undercount of the occurrences of non-citizen voters and non-citizen voting.

It costs us (EPEC) approximately ~$5K for each purchase of the statewide voter history list, and approximately $15K/year to maintain RVL records using a single baseline full purchase + 2 purchases of the 6mo MUS subscription. Due to the infrequent nature of these data purchases, it is very likely that some individuals have had their voter history or voter registration information completely removed from the record in between our purchases. Additionally, we know that the MUS data does not entirely encompass all transactions performed on the RVL by the department of elections, so there may be yet other unknown transactions that we are missing.

For information that is supposed to be publicly available (according to federal NVRA laws), the state has put up significant hurdles in order for citizens and organizations to acquire it for use it for ensuring transparency and integrity of our electoral process. If we are to have elections that are transparent and accountable to the public, then we must insist that the data be made available and accessible.

Election Data Analysis Election Integrity

State of VA Elections – Feb 2025

Post author By Jonathan Lareau
Post date February 20, 2025
No Comments on State of VA Elections – Feb 2025

The following presentation is adapted from a presentation and discussion EPEC staff had with select VA legislators on 2025-02-18. Our team was very thankful for being invited to present and explain our methods, data and observations and look forward to further interactions with legislative teams.

Status of VA Election Systems Feb-2025 (1)Download

Corrections: On slide 5, bullets 1 and 4 had typos in the dates of the RVL files mentioned. The slides were corrected and re-uploaded on 2025-02-21 at 8:40pm.

Election Data Analysis Election Forensics Election Integrity technical

Non-citizen registrations with previous voting history in VA election data – update Dec 2024

Post author By Jonathan Lareau
Post date December 4, 2024
No Comments on Non-citizen registrations with previous voting history in VA election data – update Dec 2024

We have updated our previous analysis (see March, July, Sept, Oct and Nov posts) with the latest information from the VA Department of Elections data.

Abstract:

Using the data provided by the VA Department of Elections (ELECT), we have identified at least 3,765 unique registrations that were self-identified as “Declared Non-Citizen” and removed by ELECT from the voter rolls since May of 2023. Of those 3,765 removals there were 562 that also had corresponding records of recent ballots cast at some point in the official Voter History record that we could observe. There were 1,318 associated ballots cast identified since Feb of 2019. There were an additional 3 non-citizen registrations and ballots as per the Daily Absentee List (DAL) data, that were not contained in the Voter History data. The total number of identified non-citizen ballots cast is therefore 1,321 by 565 registrants when combining unique VHL and DAL identifications.

Note that there was a large spike (see Figure 1) in declared non-citizen removals in Sept that seems to have returned to the norm in October, and dropped to almost nothing in November. We do not receive enough information from VA ELECT in order to determine if this spike and drop is due to simply the number of interactions and interest in the 2024 general election, or changes in the procedures by ELECT, etc. But we can observe this change in the data, even though we do not have enough information as to make a hypothesis as to why it is present.

https://www.gazetteleader.com/arlington/news/investigation-launched-have-non-citizens-voted-in-arlington-9379534

https://www.gazetteleader.com/arlington/news/va-attorney-general-to-be-alerted-on-possible-non-citizen-voting-9504753

Background:

In elections by the people, the qualifications of voters shall be as follows: Each voter shall be a citizen of the United States, shall be eighteen years of age, shall fulfill the residence requirements set forth in this section, and shall be registered to vote pursuant to this article. …
VA Constitution, Article II, Section 1. https://law.lis.virginia.gov/constitution/article2/section1/

Additionally, according to VA Code Section 24.2-1004, the act of knowingly casting a ballot by someone who is not eligible to vote is a Class 6 felony.

A. Any person who wrongfully deposits a ballot in the ballot container or casts a vote on any voting equipment, is guilty of a Class 1 misdemeanor.

B. Any person who intentionally (i) votes more than once in the same election, whether those votes are cast in Virginia or in Virginia and any other state or territory of the United States, (ii) procures, assists, or induces another to vote more than once in the same election, whether those votes are cast in Virginia or in Virginia and any other state or territory of the United States, (iii) votes knowing that he is not qualified to vote where and when the vote is to be given, or (iv) procures, assists, or induces another to vote knowing that such person is not qualified to vote where and when the vote is to be given is guilty of a Class 6 felony.
https://law.lis.virginia.gov/vacode/title24.2/chapter10/section24.2-1004/

EPEC looked at the number of records associated with unique voter identification numbers that had been identified for removal from the voter record due to non-citizenship status, per the entries in the MUS, and correlated those results with our accumulated voter history list information in order to determine how many non-citizen registrations had corresponding records of ballots cast in previous elections. We only considered those records that are currently in a non-active state as of the latest MUS transaction log, as some determinations of non-citizenship status in the historical MUS transaction log might have been due to error and subsequently corrected and reinstated to active status. That is, we are not considering those records that had a “Declared Non-Citizen” disqualification, but were then subsequently reinstated and reactivated by ELECT.

While EPEC has periodically purchased full copies of the Voter History List for our archive, there is a known issue with the way ELECT handles removals from the voter record that can cause sampling issues depending on the time the VHL file is purchased, and records of legitimately cast ballots to not be present in the VHL: Namely, when ELECT removes a voter from the voter list, they also remove all instances of that voter ID from voter history information and other data files provided to qualified organizations. (IMO … thats a terrible way to manage the data, but that is the way it is done.) In light of that, EPEC also used its archived versions of the Daily Absentee List (DAL) for recent elections in order to attempt to find records of votes cast that might otherwise be missing from the VHL.

Results:

There were 3,765 unique voter records marked for removal with the reason of “Declared Non-Citizen” and not subsequently reinstated in the accumulated MUS record that EPEC began collecting in mid-2023. Of those 3,765 there were 562 that also had corresponding records of recent ballots cast at some point in the official Voter History record that we could observe. There were 1,318 associated ballots cast identified since Feb of 2019. Figure 1 shows the distribution of non-citizen voters in the cumulative MUS file history. The blue trace represent the total identified and CANCELED non-citizen registrations, and the yellow trace represents the number of those records that also had corresponding records in the accumulated voter history data.

Note that the data contained in the MUS updates often covers more than a single month period. In other words, the individual MUS files are oversampled. Subsequent MUS files can therefore also have repeated entries from previous versions, as their data may overlap. Our analysis used the first unique entry for a given voter ID marked as “Declared Non-Citizen” in the cumulative MUS record in order to build Figure 1. This data oversampling in the MUS helps explain the relative increase in the May 2023 bin.

As VHL information can be incomplete depending on the time the VHL data was purchased in relation to the time that registrants were removed from voter records, EPEC also checked these non-citizen removals against the archived history of Daily Absentee List (DAL) files that EPEC has accumulated. There were an additional 2 non-citizen registrations and ballots as per the Daily Absentee List (DAL) data, that were not contained in the Voter History data. The total number of identified non-citizen ballots cast is therefore 1,321 by 535 registrants when combining unique VHL and DAL identifications.

The distribution of identified unique voter ID’s for the 562 identified non-citizen voters per VA locality is given below in Table 1. It should be noted that each ballot record has a specific locality associated with where the ballot was cast, whereas unique individuals might move between localities over time. The assignment of unique identified individuals to each locality in table 1 is therefore based on the locality listed in the specific MUS “Declared Non-Citizen” record for that individual, while the assignment of ballot cast to Localities is based on the individual VHL/DAL records. A person could have lived and voted multiple times in one county, then moved to another county and voted again before finally being determined as a non-citizen. The same person would have generated multiple VHL/DAL records for each ballot cast, and associated with potentially different localities. This should be kept in mind when attempting to interpret Table 1.

LOCALITY_NAME	REMOVED	VOTED_VHL	VOTES_VHL	VOTED_DAL_NOT_IN_VHL	VOTES_DAL_NOT_IN_VHL
ACCOMACK COUNTY	7	0	0	0	0
ALBEMARLE COUNTY	45	8	13	0	0
ALEXANDRIA CITY	169	23	39	0	0
AMELIA COUNTY	2	1	3	0	0
AMHERST COUNTY	1	0	0	0	0
APPOMATTOX COUNTY	1	0	0	0	0
ARLINGTON COUNTY	107	14	39	0	0
AUGUSTA COUNTY	15	3	3	0	0
BEDFORD COUNTY	11	2	4	0	0
BOTETOURT COUNTY	2	0	0	0	0
BRISTOL CITY	1	0	0	0	0
BRUNSWICK COUNTY	2	1	2	0	0
BUCKINGHAM COUNTY	3	0	0	0	0
CAMPBELL COUNTY	0	0	0	0	0
CAROLINE COUNTY	9	3	10	0	0
CARROLL COUNTY	8	2	5	0	0
CHARLES CITY COUNTY	2	1	3	0	0
CHARLOTTESVILLE CITY	24	2	7	0	0
CHESAPEAKE CITY	95	22	51	0	0
CHESTERFIELD COUNTY	238	25	53	0	0
CLARKE COUNTY	8	3	4	0	0
COLONIAL HEIGHTS CITY	13	1	3	0	0
COVINGTON CITY	2	0	0	0	0
CRAIG COUNTY	1	0	0	0	0
CULPEPER COUNTY	27	2	2	0	0
DANVILLE CITY	19	3	9	0	0
DICKENSON COUNTY	1	0	0	0	0
DINWIDDIE COUNTY	9	1	1	0	0
EMPORIA CITY	2	0	0	0	0
ESSEX COUNTY	3	1	3	0	0
FAIRFAX CITY	9	3	13	0	0
FAIRFAX COUNTY	680	103	227	1	1
FALLS CHURCH CITY	0	0	0	0	0
FAUQUIER COUNTY	25	3	10	0	0
FLUVANNA COUNTY	2	1	1	0	0
FRANKLIN CITY	0	0	0	0	0
FRANKLIN COUNTY	2	1	1	0	0
FREDERICK COUNTY	30	2	2	0	0
FREDERICKSBURG CITY	23	2	4	0	0
GALAX CITY	2	0	0	0	0
GILES COUNTY	2	0	0	0	0
GLOUCESTER COUNTY	2	1	1	0	0
GOOCHLAND COUNTY	5	0	0	0	0
GRAYSON COUNTY	1	0	0	0	0
GREENE COUNTY	6	1	2	0	0
HALIFAX COUNTY	1	0	0	0	0
HAMPTON CITY	60	11	13	0	0
HANOVER COUNTY	14	2	6	0	0
HARRISONBURG CITY	84	5	6	0	0
HENRICO COUNTY	79	3	14	0	0
HENRY COUNTY	0	0	2	0	0
ISLE OF WIGHT COUNTY	1	0	2	0	0
JAMES CITY COUNTY	31	4	13	0	0
KING GEORGE COUNTY	5	0	0	0	0
KING WILLIAM COUNTY	1	0	0	0	0
LOUDOUN COUNTY	234	52	114	0	0
LOUISA COUNTY	8	0	0	0	0
LYNCHBURG CITY	23	3	4	0	0
MANASSAS CITY	50	4	11	0	0
MANASSAS PARK CITY	19	1	5	0	0
MARTINSVILLE CITY	6	1	0	0	0
MECKLENBURG COUNTY	7	3	10	0	0
MIDDLESEX COUNTY	2	0	0	0	0
MONTGOMERY COUNTY	1	0	3	0	0
NELSON COUNTY	2	0	0	0	0
NEW KENT COUNTY	3	1	0	0	0
NEWPORT NEWS CITY	111	23	48	0	0
NORFOLK CITY	95	12	32	0	0
NORTHAMPTON COUNTY	1	0	1	0	0
NORTHUMBERLAND COUNTY	3	2	5	0	0
NORTON CITY	1	0	0	0	0
NOTTOWAY COUNTY	4	0	0	0	0
ORANGE COUNTY	3	1	4	0	0
PATRICK COUNTY	1	0	0	0	0
PETERSBURG CITY	28	3	9	0	0
PITTSYLVANIA COUNTY	7	2	4	0	0
PORTSMOUTH CITY	44	13	38	0	0
POWHATAN COUNTY	3	0	1	0	0
PRINCE EDWARD COUNTY	10	3	11	0	0
PRINCE GEORGE COUNTY	12	0	0	0	0
PRINCE WILLIAM COUNTY	414	61	139	0	0
PULASKI COUNTY	6	1	2	0	0
RAPPAHANNOCK COUNTY	2	0	0	0	0
RICHMOND CITY	171	24	64	1	1
RICHMOND COUNTY	0	0	0	0	0
ROANOKE CITY	72	3	1	0	0
ROANOKE COUNTY	19	2	3	0	0
ROCKBRIDGE COUNTY	0	0	1	0	0
ROCKINGHAM COUNTY	21	5	13	0	0
RUSSELL COUNTY	3	1	1	0	0
SALEM CITY	3	0	0	0	0
SCOTT COUNTY	1	1	4	0	0
SHENANDOAH COUNTY	18	2	6	0	0
SMYTH COUNTY	2	0	0	0	0
SPOTSYLVANIA COUNTY	65	4	10	0	0
STAFFORD COUNTY	97	17	36	1	1
STAUNTON CITY	4	0	0	0	0
SUFFOLK CITY	40	15	38	0	0
SURRY COUNTY	1	0	0	0	0
SUSSEX COUNTY	2	1	3	0	0
TAZEWELL COUNTY	4	1	1	0	0
VIRGINIA BEACH CITY	165	21	69	0	0
WARREN COUNTY	14	2	5	0	0
WASHINGTON COUNTY	5	2	6	0	0
WAYNESBORO CITY	3	0	0	0	0
WESTMORELAND COUNTY	1	0	0	0	0
WILLIAMSBURG CITY	11	1	0	0	0
WINCHESTER CITY	24	2	2	0	0
WISE COUNTY	2	1	2	0	0
WYTHE COUNTY	3	0	0	0	0
YORK COUNTY	22	11	41	0	0
	3765	562	1318	3	3

The distribution of the 1,318 ballots that were identified as being cast by non-citizen voters (the yellow trace in Figure 1) in previous elections is shown in Figure 2. The most significant spikes are in the 2019, 2020, 2021 and 2022 November General elections, as well as the 2020 March Democratic presidential primary. Figure 3, which shows this distribution as a percentage of votes cast. Please note the scale of the Y-axis on the percent plot in Figure 3 is in percent of total ballots cast in each election. These graphs were only produced for the VHL data, and do not include the DAL identified records.

Discussion

Q: How did these registrants get placed onto the voter rolls in the first place?

Q: What procedures, processes and technical solutions are in place to prevent current or future registration and casting of ballots by non-citizens? This is especially pertinent given the current state of the flow of illegal immigrants crossing our national borders. According to a recent report by Yahoo Finance, VA is one of the top 30 destinations for illegal migrants, with both Loudoun County and Fairfax making the list.

Additionally, this evidence which is derived from only official state records, directly contradicts multiple news media reports and attestations that non-citizen voting is a “Myth”, and that non-citizen voting happens “almost never”. If the data from ELECT is accurate, then there are at least 1,318 ballots that have been cast by non-citizen voters just since 2019. Now, that is still very infrequent, but it is not “almost never.” It is a legitimate concern … and these discoveries are only the registrations that have been found and removed from the voter roles by ELECT and that we can observe in the data. We do not know how many exist that we do not know about.

Election Data Analysis Election Forensics Election Integrity mathematics technical

Differential Invalidation in VA 2024 General Election Data – Presidential Race

Post author By Jonathan Lareau
Post date November 18, 2024
No Comments on Differential Invalidation in VA 2024 General Election Data – Presidential Race

Using data published by the VA Department of Elections (“ELECT”), we plotted the Ballot Invalidation Rate (BIR) vs. the % of vote share for the winner in order to attempt to determine if “Differential Invalidation” of ballots occurred in the 2024 VA General Election. The plotted data appears to show differential invalidation and suggests that there are underlying issues that should be investigated and addressed, including data reliability and consistency issues where the number of reported total votes cast is greater than the number of reported ballots cast for some localities.

Details

“Differential invalidation” takes place when the ballots of one candidate or position are invalidated at a higher rate than for other candidates or positions. Note that differential invalidation does not directly indicate any sort of fraud. It is however indicative of an unfairness or inequality in the rate of incomplete or invalid ballots conditioned on candidate choice. While it could be caused by fraud or malfeasance, it could also be caused by confusing ballot layout, poor procedural controls and uniformity, under-voting (not choosing a candidate) by the voter, or other compounding factors, etc. (ref: [1] ch. 6)

The Free and Fair Hypothesis

In a democratic election, each persons vote counts the same. There are other requirements, but this is a necessary condition. In the presense of invalidation, the free and fair hypothesis reduces to each person’s vote having the same probability of being invalidated as any other persons ballot. From a statistical standpoint, this means that the invalidation must be independent of the candidate chosen on the ballot (or of the person voting) [ref: 1, pg. 132]

The data used for this analysis was the “unofficial” election results (the certified results are not yet published), and comes directly from the VA Dept of Elections. The data was downloaded on Nov 18th at 4:34 pm. We purposefully waited to perform this analysis until after the localities had completed their canvass operations, and for the data feeds on the VA Department of Elections (“ELECT”) website to mostly stabilize. The actual certified results will not be available until at least Dec 2 after the State Electoral Board meets to finalize the certification. We will revisit this analysis at that time.

The list of the number of Active Voters, Inactive Voters and total Ballots cast in a given locality can be found here: https://enr.elections.virginia.gov/results/public/Virginia/elections/2024NovemberGeneral/voters

Figure 1: Table of voter registration statistics and number of ballots cast as appeared on the VA Dept of Elections Website on Nov 18 16:34:00 EST at https://enr.elections.virginia.gov/results/public/Virginia/elections/2024NovemberGeneral/voters

The Turnout statistics and Vote Tally reports CSV file reports can be downloaded from here: https://enr.elections.virginia.gov/results/public/Virginia/elections/2024NovemberGeneral/reports
- The direct link to the turnout report used for this computation is here: https://enr.elections.virginia.gov/cdn/results/d2c804ee-4ec2-46bb-91d7-5b41526eab03/Election%20Turnout_bb03ef46-1af5-4f14-86a8-ead1b9094036.csv
- The direct link for the vote tally results report is here: https://enr.elections.virginia.gov/cdn/results/d2c804ee-4ec2-46bb-91d7-5b41526eab03/Election%20Results_4b7d6963-0b3a-4b2f-b331-d9d14ca39bc0.csv

Figure 2: Listing of the link for report CSV files as appeared on the VA Dept of Elections Website on Nov 18 16:34:00 EST at https://enr.elections.virginia.gov/results/public/Virginia/elections/2024NovemberGeneral/reports. Note that additional CSV files for “Election Winners”, “Election Change Log Report”, “EnrAbsenteeRawCSV”, as well as a complete JSON listing under the “Media Export” link at the bottom of the page.

With this dataset in hand we can know how many ballots were cast, as well as how many votes were counted for each candidate in each race in each locality (at least as reported by the state). For a given race, we can then compute the number of incomplete or invalid ballots by subtracting the total number of votes recorded for that race in the locality from the total number of reported ballots cast.

In accordance with the techniques presented in [1] and [2], we computed the plots of the Invalidation Rate vs the Percent Vote Share for the Winner in an attempt to observe if there looks to be any evidence of Differential Invalidation ([1], ch 6). This is similar to the techniques presented in [2], which we have used previously to produce election fingerprint that plotted the 2D histograms of the vote share for the winner vs the turnout percentage. (The 2024 versions are coming, just not ready yet.)

Each dot in Figure 3 below is representing the ballots from a specific locality. The x axis is the percent vote share for the winner (Harris), and the y axis is the ballot invalidation rate, and is computed as 100 – 100 * Nvotes / Nballots.

Figure 3: Plot of the Invalidation rate vs the % of vote share for the winner in each locality in the 20204 VA General Election for President.

A few things are immediately apparent from the plot in Figure 3:

There is clearly a distinction in the invalidation rate between localities that had low vote share and high vote share for harris.
- The data for localities where Harris had low vote share do not have a large distribution of invalidation rates, whereas the high vote share localities do.
There are a number of localities that are reporting negative invalidation rates. How is this possible, you ask? Well there are a number of localities in the CSV data that have higher vote totals than the corresponding reported number of total ballots cast in the locality.

This implies that there is something significantly wrong in the data and reporting tools or procedures used by ELECT, as all of this data was pulled nearly simultaneously and therefore the data should be at least self-consistent. While we understand that this is still unofficial data and that new updates may occur over time, at any given point in time the data should at least be self-consistent.

Note that there are still a few localities that have not yet had their vote totals reflected in the CSV files from ELECT. Those localities were omitted from this analysis. The combined information from all of the data source files that was used to generate this plot is available below.

Combined Results and Turnout Stats Table Download

In conclusion there does appear to be some indications that differential invalidation occurred in the 2024 VA General Election for President. Due to data inconsistencies and the fact that this data is still officially “unofficial” it is hard to make any definitive conclusions, but these results are suggestive of the existence of multiple underlying issues that need to be examined, understood and/or resolved. We can definitively say, however, that this is yet another example of the data streams from ELECT lacking self-consistency, which is a big problem in and of itself.

References

[1] Forsberg, O.J. (2020). Understanding Elections through Statistics: Polling, Prediction, and Testing (1st ed.). Chapman and Hall/CRC. https://doi.org/10.1201/9781003019695
[2] Klimek, Peter & Yegorov, Yuri & Hanel, Rudolf & Thurner, Stefan. (2012). Statistical Detection of Systematic Election Irregularities. Proceedings of the National Academy of Sciences of the United States of America. 109. 16469-73. https://doi.org/10.1073/pnas.1210722109.

Election Data Analysis Election Forensics Election Integrity technical

2024 VA Election Night Reporting over Time – President and Vice-President

Post author By Jonathan Lareau
Post date November 14, 2024
No Comments on 2024 VA Election Night Reporting over Time – President and Vice-President

EPEC monitored and archived the election night reporting (“ENR”) files published by the VA Department of Elections (“ELECT”) for the 2024 General Election, using the “media export” link at the bottom of the page to periodically download a JSON file with the data. On this page you will find the plots of the Locality reported summary counts from the JSON files as they changed over time.

As we noted in our post discussing some of our observations on the reliability of the JSON data feed, there are a few “glitches” that can be notices in the data plots below. While we are unsure as to the cause of the “glitches” in the data, it does not appear that any of them affected the outcome of the election.

Election Data Analysis Election Forensics Election Integrity technical

2024 VA Election Night Reporting over Time – United States Senate

Post author By Jonathan Lareau
Post date November 14, 2024
No Comments on 2024 VA Election Night Reporting over Time – United States Senate

Election Data Analysis Election Forensics Election Integrity technical

Observed data reliability issues with VA 2024 election night reporting data

Post author By Jonathan Lareau
Post date November 9, 2024
No Comments on Observed data reliability issues with VA 2024 election night reporting data

Our team at EPEC monitored the Election Night Reporting (ENR) data feed published by the VA Department of Elections (ELECT) during the 2024 General election. While we have not finished collecting or examining the election results, we have observed a few issues already that are presented below. (Note there is a TON more data analysis to do … these are just preliminary observations.)

There were three specific issues that we have noticed so far in our analysis of the ENR from ELECT. None of these issues look to have impacted the count, as far as we can tell, but these are system or procedural issues that should be documented, addressed and corrected going forward. We will continue to update on these issues as we find out more.

1. There was a distinct jump, and then immediate reversion, in 11 localities at 12am on Nov 6 in the ENR data feed. Bland County, Cambell County, Carrol county, Cumberland County, Dinwiddie County, Floyd County, Franklin City, Norfolk City, Nottaway County, Orange County and Rappahannock County all show the same “glitch”. The image galleries below show this “glitch” in the data feed all occurring at midnight. We have not (yet) reached out directly to ELECT or any of these counties yet to inquire as to the reason. We will update the blog if we find out anything as to the cause of this “glitch”.

2. The second issue we noticed was that Orange County had the first report of their totals at 10:10pm Nov 5 and initially reported the election day count for Donald Trump (7,891) and Kamala Harris (3,852). Then the data feed immediately removed the election day count information and began reporting the just the Early Voting and Mailed Absentee numbers (5,593 Trump, 4,259 Harris), only to add the same Election Day totals back in to the data feed at 9:06am Nov6.

3. Another interesting thing our team observed is that there were significant issues with the reliability public feed provided by VA ELECT over the course of the vote counting. Multiple team members were monitoring the feed from different locations around the state, and therefore had different endpoint internet connection configurations, yet we all noticed that the feed would routinely produce incorrectly formatted JSON data that could not be parsed by standard tools.

The errors seemed random and did not have a specific repeatable pattern when we tried to look at the data being returned. Sometimes it was simply a missing bracket, or quotation mark. Other times it appeared to be missing or malformed sections of the data. Python, MATLAB, Tableau and other standard JSON parser libraries were unable to parse these errant data files.

This might be an issue with IT infrastructure of bandwidth issues at ELECT causing dropped data packets? Or possibly an error in the server-side systems that respond to GET requests for the data? We do know that within the last few years, ELECT has partnered with “Enhanced Voting” to supply the ENR data feed. We hope this feedback is useful to help ELECT improve their ability to supply the public with reliable Election Night Reporting, and are happy to work with ELECT to help identify and correct these issues.

Recent Posts

Recent Comments

Archives

Categories