Communities

Writing
Writing
Codidact Meta
Codidact Meta
The Great Outdoors
The Great Outdoors
Photography & Video
Photography & Video
Scientific Speculation
Scientific Speculation
Cooking
Cooking
Electrical Engineering
Electrical Engineering
Judaism
Judaism
Languages & Linguistics
Languages & Linguistics
Software Development
Software Development
Mathematics
Mathematics
Christianity
Christianity
Code Golf
Code Golf
Music
Music
Physics
Physics
Linux Systems
Linux Systems
Power Users
Power Users
Tabletop RPGs
Tabletop RPGs
Community Proposals
Community Proposals
tag:snake search within a tag
answers:0 unanswered questions
user:xxxx search by author id
score:0.5 posts with 0.5+ score
"snake oil" exact phrase
votes:4 posts with 4+ votes
created:<1w created < 1 week ago
post_type:xxxx type of post
Search help
Notifications
Mark all as read See all your notifications »
Meta

Character set conversion(?) failure during initial import from SE

+3
−0

It looks like something went wrong during the import, causing imported posts to get the wrong character encoding.

I fixed one at https://speculative-science.codidact.com/q/225476, but have come across others as well.

Can this somehow be fixed in bulk, ideally in place without having to re-do the import?

It would be a pain to have to go through everything and manually fix non-US-ASCII characters.

History
Why does this post require moderator attention?
You might want to add some details to your flag.
Why should this post be closed?

0 comment threads

1 answer

+2
−0

I've fixed the most obvious failures in SQL. Beyond that... the posts are now UTF8, so fixing the encoding retroactively is difficult if not impossible - if there are other common bugs please let me know and I'll run an automatic replacement, otherwise please edit them to correct.

History
Why does this post require moderator attention?
You might want to add some details to your flag.

0 comment threads

Sign up to answer this question »