Logs: liberachat/#haskell
| 2021-06-11 13:35:21 | → | benin03 joins (~benin@183.82.205.218) |
| 2021-06-11 13:35:30 | × | benin03 quits (~benin@183.82.205.218) (Client Quit) |
| 2021-06-11 13:36:58 | × | feetwind quits (~mike@user/feetwind) (Ping timeout: 272 seconds) |
| 2021-06-11 13:37:01 | × | teaSlurper quits (~chris@81.96.113.213) (Ping timeout: 272 seconds) |
| 2021-06-11 13:37:22 | → | feetwind joins (~mike@user/feetwind) |
| 2021-06-11 13:37:24 | <tomsmeding> | Maxdamantus: if you're using the HTML page for submitting a paste, it's going to be utf8 because (according to MDN) the default encoding for a form submission is the encoding of the page -- which is utf8 here |
| 2021-06-11 13:37:55 | <tomsmeding> | so in principle, you can only see that encoding error from the server if you're submitting non-utf8 data in a manual curl request |
| 2021-06-11 13:38:06 | × | bontaq quits (~user@ool-18e47f8d.dyn.optonline.net) (Ping timeout: 264 seconds) |
| 2021-06-11 13:39:03 | <Maxdamantus> | I imagine that depends how the browser handles "UTF-8". |
| 2021-06-11 13:39:32 | <Maxdamantus> | A sensible handling of "UTF-8" in my opinion would involve preserving error bytes if you've copied it from some other source. |
| 2021-06-11 13:39:56 | <Maxdamantus> | I'm not sure if browsers do that, but I suspect if they don't it probably just has to do with alternative representations. |
| 2021-06-11 13:40:20 | <Maxdamantus> | (since browsers are still at least to some extent based around 16-bit strings) |
| 2021-06-11 13:40:35 | Maxdamantus | tries. |
| 2021-06-11 13:41:44 | × | nsilv quits (~nsilv@212.103.198.210) (Ping timeout: 264 seconds) |
| 2021-06-11 13:46:06 | → | argento joins (~argent0@168-227-96-53.ptr.westnet.com.ar) |
| 2021-06-11 13:47:38 | <Maxdamantus> | Hmm .. Firefox seems to do some funny mojibake when I try to paste UTF-8 data with some bad bytes in the middle. |
| 2021-06-11 13:48:16 | <Maxdamantus> | It reinterprets my text as Chinese characters when inserting them into a text input. |
| 2021-06-11 13:49:54 | → | lavaman joins (~lavaman@98.38.249.169) |
| 2021-06-11 13:49:57 | → | eggplantade joins (~Eggplanta@2600:1700:bef1:5e10:1d2f:92ad:f3ce:a171) |
| 2021-06-11 13:50:05 | <Maxdamantus> | Chrome at least doesn't assume a different encoding, but it does replace the errors with replacement characters. |
| 2021-06-11 13:51:06 | → | ikex joins (~ash@user/ikex) |
| 2021-06-11 13:51:20 | <Maxdamantus> | but again, I suspect one of the reasons for that is that those elements are based around 16-bit characters, so they will naturally just insert replacement characters when trying to do a UTF-8 to UTF-16 conversion. |
| 2021-06-11 13:51:30 | <Maxdamantus> | 16-bit strings* |
| 2021-06-11 13:51:59 | <Maxdamantus> | Might be different for a web browser that's not based on JavaScript. |
| 2021-06-11 13:52:54 | → | geekosaur joins (~geekosaur@069-135-003-034.biz.spectrum.com) |
| 2021-06-11 13:53:41 | × | trent1 quits (~trent@2001:8003:340d:d00:b2de:b98:7a93:b0ea) (Ping timeout: 252 seconds) |
| 2021-06-11 13:53:48 | → | nsilv joins (~nsilv@212.103.198.210) |
| 2021-06-11 13:54:04 | × | xff0x_ quits (~xff0x@217.110.198.158) (Ping timeout: 272 seconds) |
| 2021-06-11 13:54:14 | × | lavaman quits (~lavaman@98.38.249.169) (Ping timeout: 252 seconds) |
| 2021-06-11 13:54:27 | × | eggplantade quits (~Eggplanta@2600:1700:bef1:5e10:1d2f:92ad:f3ce:a171) (Ping timeout: 244 seconds) |
| 2021-06-11 13:54:58 | × | jespada quits (~jespada@90.254.242.55) (Ping timeout: 244 seconds) |
| 2021-06-11 13:55:35 | → | haskl joins (~haskeller@2601:643:897f:561d::af3) |
| 2021-06-11 13:55:49 | → | kluk joins (~kluk@2603-7000-9b3f-6934-8981-4e0b-0555-b388.res6.spectrum.com) |
| 2021-06-11 13:56:28 | → | jespada joins (~jespada@90.254.242.55) |
| 2021-06-11 14:00:25 | × | fryguybob quits (~fryguybob@cpe-74-65-31-113.rochester.res.rr.com) (Quit: leaving) |
| 2021-06-11 14:03:23 | × | kluk quits (~kluk@2603-7000-9b3f-6934-8981-4e0b-0555-b388.res6.spectrum.com) (Read error: Connection reset by peer) |
| 2021-06-11 14:04:45 | <Maxdamantus> | Heh. It works as I described using edbrowse. |
| 2021-06-11 14:04:51 | → | kluk joins (~kluk@2603-7000-9b3f-6934-8981-4e0b-0555-b388.res6.spectrum.com) |
| 2021-06-11 14:05:29 | × | orion quits (~orion@user/orion) (Ping timeout: 245 seconds) |
| 2021-06-11 14:06:38 | <Maxdamantus> | and Dillo. |
| 2021-06-11 14:08:08 | <Maxdamantus> | Both browsers have handling for UTF-8, but there isn't any real reason for them to corrupt the error bytes, since they're presumably just handling the UTF-8 bytes internally. |
| 2021-06-11 14:08:23 | <tomsmeding> | s/bytes/as bytes/ ? |
| 2021-06-11 14:08:31 | <tomsmeding> | that's what I would expect, and that's also what pastebin-haskell does |
| 2021-06-11 14:08:51 | <tomsmeding> | either you don't care about encoding and just process bytes, or you ensure an encoding and work with that |
| 2021-06-11 14:09:38 | × | dhil quits (~dhil@195.213.192.47) (Ping timeout: 252 seconds) |
| 2021-06-11 14:09:39 | <tomsmeding> | while there is some merit with trying to use an encoding but retaining error bytes in case they exist, I guess, it sounds like an approach that is guaranteed to give bugs for eternity |
| 2021-06-11 14:09:41 | <Maxdamantus> | Well, Dillo cares about the encoding, because the valid UTF-8 subsequences are rendered as the expected Unicode characters. |
| 2021-06-11 14:09:56 | → | Lycurgus joins (~juan@cpe-45-46-140-49.buffalo.res.rr.com) |
| 2021-06-11 14:09:56 | × | kluk quits (~kluk@2603-7000-9b3f-6934-8981-4e0b-0555-b388.res6.spectrum.com) (Read error: Connection reset by peer) |
| 2021-06-11 14:10:01 | → | eggplantade joins (~Eggplanta@2600:1700:bef1:5e10:1d2f:92ad:f3ce:a171) |
| 2021-06-11 14:10:10 | <tomsmeding> | or just very slow code if you have to re-check every part again every time you use the data because you can't actually assume any of your strings are valid in your encoding of choice |
| 2021-06-11 14:10:54 | <tomsmeding> | I wonder what Dillo would do if the page is in utf-16 |
| 2021-06-11 14:11:18 | <tomsmeding> | would it actually go convert to utf-16 or does it just do bytes internally up until it has to draw something on-screen |
| 2021-06-11 14:11:25 | <Maxdamantus> | I'm actually sceptical about what you said about the page encoding. |
| 2021-06-11 14:11:36 | <Maxdamantus> | I'll have to try that too. |
| 2021-06-11 14:13:53 | × | dunkeln quits (~dunkeln@94.129.65.28) (Quit: leaving) |
| 2021-06-11 14:14:51 | × | eggplantade quits (~Eggplanta@2600:1700:bef1:5e10:1d2f:92ad:f3ce:a171) (Ping timeout: 268 seconds) |
| 2021-06-11 14:15:32 | <Maxdamantus> | Hm, okay, it does seem to be true. |
| 2021-06-11 14:15:43 | <Maxdamantus> | but it's confusing, because I don't see anything in the request denoting the character set. |
| 2021-06-11 14:16:08 | <Maxdamantus> | Just has "Content-Type: application/x-www-form-urlencoded". |
| 2021-06-11 14:16:55 | × | curiousgay quits (~quassel@178.217.208.8) (Read error: Connection reset by peer) |
| 2021-06-11 14:17:21 | → | MoC joins (~moc@user/moc) |
| 2021-06-11 14:17:31 | × | anandprabhu quits (~anandprab@87.201.97.214) (Quit: Leaving) |
| 2021-06-11 14:17:50 | → | curiousgay joins (~quassel@178.217.208.8) |
| 2021-06-11 14:19:33 | → | jess joins (~jess@libera/staff/jess) |
| 2021-06-11 14:19:58 | <Maxdamantus> | Dillo doesn't seemo to support other encodings. |
| 2021-06-11 14:20:18 | <Maxdamantus> | Which is fair enough, since "utf-8" is the only valid encoding in HTML5. |
| 2021-06-11 14:20:48 | × | Crypt9a2e quits (~Crypt9a2e@157.51.3.28) (Ping timeout: 244 seconds) |
| 2021-06-11 14:22:22 | → | dhil joins (~dhil@80.208.56.181) |
| 2021-06-11 14:24:22 | → | nijopa joins (~nijopa@pool-98-118-125-240.bstnma.fios.verizon.net) |
| 2021-06-11 14:26:44 | × | azeem quits (~azeem@dynamic-adsl-94-34-34-125.clienti.tiscali.it) (Ping timeout: 245 seconds) |
| 2021-06-11 14:26:57 | → | azeem joins (~azeem@176.200.240.211) |
| 2021-06-11 14:28:19 | × | psydroid quits (~psydroidm@2001:470:69fc:105::165) (Changing host) |
| 2021-06-11 14:28:19 | → | psydroid joins (~psydroidm@user/psydroid) |
| 2021-06-11 14:30:20 | → | eggplantade joins (~Eggplanta@2600:1700:bef1:5e10:1d2f:92ad:f3ce:a171) |
| 2021-06-11 14:30:43 | × | Lycurgus quits (~juan@cpe-45-46-140-49.buffalo.res.rr.com) (Quit: Exeunt) |
| 2021-06-11 14:33:13 | × | hgolden quits (~hgolden2@cpe-172-114-84-61.socal.res.rr.com) (Quit: Konversation terminated!) |
| 2021-06-11 14:34:46 | → | wroathe joins (~wroathe@c-68-54-25-135.hsd1.mn.comcast.net) |
| 2021-06-11 14:34:56 | × | eggplantade quits (~Eggplanta@2600:1700:bef1:5e10:1d2f:92ad:f3ce:a171) (Ping timeout: 252 seconds) |
| 2021-06-11 14:35:37 | <wwalker> | lyxia: Thank you |
| 2021-06-11 14:36:50 | × | qbt quits (~edun@user/edun) (Quit: WeeChat 3.1) |
| 2021-06-11 14:37:10 | → | qbt joins (~edun@user/edun) |
| 2021-06-11 14:37:20 | → | Sgeo joins (~Sgeo@user/sgeo) |
| 2021-06-11 14:38:03 | → | hgolden joins (~hgolden2@cpe-172-114-84-61.socal.res.rr.com) |
| 2021-06-11 14:39:33 | × | hendursa1 quits (~weechat@user/hendursaga) (Quit: hendursa1) |
| 2021-06-11 14:40:20 | → | hendursaga joins (~weechat@user/hendursaga) |
| 2021-06-11 14:41:32 | → | teaSlurper joins (~chris@81.96.113.213) |
| 2021-06-11 14:45:54 | × | teaSlurper quits (~chris@81.96.113.213) (Ping timeout: 245 seconds) |
| 2021-06-11 14:50:06 | × | dhil quits (~dhil@80.208.56.181) (Ping timeout: 264 seconds) |
| 2021-06-11 14:50:35 | → | bontaq` joins (~user@ool-18e47f8d.dyn.optonline.net) |
| 2021-06-11 14:50:39 | → | eggplantade joins (~Eggplanta@2600:1700:bef1:5e10:1d2f:92ad:f3ce:a171) |
| 2021-06-11 14:51:49 | × | azeem quits (~azeem@176.200.240.211) (Read error: Connection reset by peer) |
| 2021-06-11 14:51:59 | → | altern joins (~Sergii@altern.corbina.com.ua) |
| 2021-06-11 14:52:27 | → | azeem joins (~azeem@dynamic-adsl-78-13-238-239.clienti.tiscali.it) |
| 2021-06-11 14:53:11 | × | altern quits (~Sergii@altern.corbina.com.ua) (Client Quit) |
| 2021-06-11 14:53:26 | → | Scotty_Trees2 joins (~Scotty_Tr@162-234-179-169.lightspeed.brhmal.sbcglobal.net) |
| 2021-06-11 14:54:54 | × | berberman quits (~berberman@user/berberman) (Ping timeout: 244 seconds) |
| 2021-06-11 14:54:56 | × | Scotty_Trees quits (~Scotty_Tr@162-234-179-169.lightspeed.brhmal.sbcglobal.net) (Ping timeout: 264 seconds) |
| 2021-06-11 14:55:02 | Scotty_Trees2 | is now known as Scotty_Trees |
| 2021-06-11 14:55:21 | → | berberman joins (~berberman@user/berberman) |
All times are in UTC.