Detect file encoding and convert it to UTF-8 without BOM #detecting file encoding
Edit
by Ashok - 6 years ago (2019-01-23)
I am unable to detect file encoding, that needs to be converted
| I am working on a project where I need to upload csv to databse, but here the problem is the file is encoded in the unknown format, so I am unable to upload the file to the database.
I tried many ways like mb_detect_encoding and tried to convert into UTF-8. but not succeed.
Can anyone try to help me to solve this problem? |
- 2 Clarification requests
2.
by Ray Paseur - 6 years ago (2019-01-31) Reply
This class (#11059) has not been approved yet, but it should help you.
https://www.phpclasses.org/package/11059-PHP-Validate-strings-in-UTF-8-encoding.html
1.
by Ray Paseur - 6 years ago (2019-01-28) Reply
Hi, Ashok. I can help with this. Please give me a test file, or better yet a link to a set of test files. UTF-8 is self-evident, and BOM is easy to remove. I wrote an article about this a few years ago.
My email is Ray.Paseur [at] Gmail if you want to send me a link to the test data. If you send me an email, I will send you links to my articles and presentation deck. Best regards, Ray
Ask clarification
4 Recommendations
PHP Common Class Library: Set of classes that provides common functionality
This package provides a set of classes that provides common functionality.
It provides several classes that provide different types of functionality that is useful for many types of PHP applications. Currently it provides:
- A class for caching class that can store data either in files, a database using PDO, MemCached, APC and Redis
- A complex string handling class
- Manipulate text strings that may be in different character set encodings
- Manipulate locale sensitive data types like strings, integers and fractions
- Parse configuration strings in YAML format
| by Caleb package author 30 - 5 years ago (2019-06-24) Comment
Check out my "Demojibakefier" class. Could be useful for what you need. :-) |
PHP Convert CSV to UTF-8: Convert a CSV file to have data in UTF-8 encoding
This class can convert a CSV file to have data in UTF-8 encoding.
It takes the name of a file with data in CSV format, detects the encoding of the text data that it contains and converts it to UTF-8 in case the data is not already in this encoding.
The resulting data can be stored in the same file or another file with a given name.
| by peyman package author 65 - 6 years ago (2019-02-03) Comment
hi. I write this class especially for your problem, please check this out and see if it will help your problem |
Class that outputs a table with the data from the result rows of a database query. It features:
- Database independency (works with any DBMS supported by Metabase).
- Splits the display of the result rows in multiple pages of configurable number of rows displaying automatic links to Next, Previous, First, Last, etc.. pages.
- Arbitrary column display display.
- Automatic alignmnent of columns according to their data types.-
- Configurable colors for the table headers and data alternating between even and odd rows.
| by Agro Biz 60 - 6 years ago (2019-02-02) Comment
010 01, Strážov ,Žilina |
PHP UTF-8 Validation: Validate and repair strings in UTF-8 encoding
This class can validate and repair strings in UTF-8 encoding.
It takes a text string and checks if the characters are valid in UTF-8.
The class can also repair an invalid string by removing some invalid UTF-8 characters sequences and Byte-Order Marks.
The class can return an object instance of itself with the string, byte length, character count, and the position of any encoding errors.
| by Ray Paseur package author 120 - 6 years ago (2019-01-31) Comment
See if this helps. If you use this class and have any difficulty with it, please reach out to me. Best of luck, Ray |
- 1 Comment
1.
by Johnny Mast - 6 years ago (2019-04-01) Reply
based on this request im working on a package that can scan xx content (emails / files / websites) and detect nom bom content. Content that could display unsupported chars on any content. Developers will be able to add scanning mechanisms and let the engine detect non bom chars in email / csv / files / db / etc etc .. and those adapters could fix the content. It's meganism will be a bit like flysystem for the filesystem.