Home freenode/#haskell: Logs Calendar

Logs: freenode/#haskell

←Prev  Next→
Page 1 .. 602 603 604 605 606 607 608 609 610 611 612 .. 5022
502,152 events total
2020-10-12 23:49:53 <Axman6> newhoggy: I'd be interested to hear if you find anything you could use in hw-dsv in there
2020-10-12 23:51:20 <newhoggy> Yep. I had a skim and would like to look into it in more detail later.
2020-10-12 23:51:23 m1000 parts (~Boss@2a02:8108:96bf:b068:9dab:a091:4eab:df71) ()
2020-10-12 23:51:45 <gentauro> .NET also have number separators like -> https://twitter.com/denicmarko/status/1314198060117884928
2020-10-12 23:51:54 <gentauro> will it ever come to Haskell for better readability?
2020-10-12 23:52:15 <Axman6> GHC already has that
2020-10-12 23:52:40 <gentauro> Axman6: `Use NumericUnderscores to allow underscores in integer literals` :o
2020-10-12 23:52:44 <gentauro> I didn't knew that
2020-10-12 23:52:48 <Axman6> yes
2020-10-12 23:53:21 justsomeguy joins (~justsomeg@unaffiliated/--/x-3805311)
2020-10-12 23:53:31 <Axman6> not sure if Ada came up with that, but it's one of the features of Ada I always liked. great for binary literals: 0b1101_0101_1100_1010
2020-10-12 23:53:32 <gentauro> I like it !!!
2020-10-12 23:53:56 <gentauro> Axman6: `great for binary literals: 0b1101_0101_1100_1010` exactly
2020-10-12 23:54:01 st8less joins (~st8less@2603:a060:11fd:0:9c66:9b18:c21:60c)
2020-10-12 23:54:08 <no-n> it's in perl and C# too
2020-10-12 23:54:10 × ahmr88 quits (~ahmr88@cpc85006-haye22-2-0-cust131.17-4.cable.virginm.net) (Remote host closed the connection)
2020-10-12 23:54:10 <gentauro> but also big numbers: 42_000_000_000
2020-10-12 23:54:34 <Axman6> except when you confuse people by doing 42_000_000_0000 =)
2020-10-12 23:54:39 <newhoggy> Axman6: One of the design choices I made was to load the CVS via a lazy ByteString. I suspect there is a performance penalty for doing that and the method is not easily parrallelisable. There is an option to `mmap` to increase parallelism but in this case there is the down side the process will clock up a lot of memory usage which may not be suitable for some kinds of jobs.
2020-10-12 23:55:28 <Axman6> do you know if the chunks of the lazy bytestring are mmapped?
2020-10-12 23:55:34 <hekkaidekapus> dminuoso: “We do not discuss overlapping instances, implicit parameters, superclasses, or functional dependencies, although our implementation deals with all of these.” — (Emphasis is mine) Hmm… The paper may not be what you want if the implementation has changed in recent years. I would love to continue the digging, alas bed is calling. See you later.
2020-10-12 23:55:51 <gentauro> 01:54 < Axman6> except when you confuse people by doing 42_000_000_0000 =)
2020-10-12 23:55:56 <newhoggy> The kinds of jobs I'm thinking of are those that run in typical memory constrained VMs in where virtual memory is usually turned off (like in AWS)
2020-10-12 23:56:04 <gentauro> people doing that should loose at least a finger :)
2020-10-12 23:56:12 <int-e> Yeah with {-# LANGUAGE BinaryLiterals, NumericUnderscores #-} you can have 0b1000_000 or maybe 0b_1000_0000
2020-10-12 23:56:28 <Axman6> sine it is mmapped the memory usage isn't real memory usage right? the OS can page them out if needed since it's loaded read only
2020-10-12 23:56:56 <justsomeguy> gentauro: Seems like an easy typo to make. Maybe it should be checked by the compiler.
2020-10-12 23:57:21 <newhoggy> Swap gets turned off a lot on AWS because of the higher latency to disk of most machine types.
2020-10-12 23:57:22 <int-e> (Hmm, the first one came out short :) )
2020-10-12 23:57:28 <Axman6> newhoggy: I would be surprised if you can't mmap a huge file read only even if there isn't enough ram
2020-10-12 23:58:45 <newhoggy> I haven't tested it, but you may be right in that an mmapped file will function as expected even if swap is disabled.
2020-10-12 23:59:31 × merijn quits (~merijn@83-160-49-249.ip.xs4all.nl) (Ping timeout: 260 seconds)
2020-10-13 00:00:01 × CrystalNice quits (~CrystalNi@s91904426.blix.com) ()
2020-10-13 00:00:02 <newhoggy> I think the danger is if a page you're not done with gets evicted because you're aggressively pulling in new pages and you are forced to pay to reload it. On AWS, that's likely to be more expensive.
2020-10-13 00:00:50 <newhoggy> The other reason streaming is prefferrably is the CSV files *usually* come from S3 in compressed form rather than from local disk.
2020-10-13 00:00:58 <newhoggy> preferrable*
2020-10-13 00:01:16 <ski> % 0x10_10.10_10
2020-10-13 00:01:16 <yahb> ski: 4112.062744140625
2020-10-13 00:01:30 <ski> % 0b10_10.10_10
2020-10-13 00:01:31 <yahb> ski: ; <interactive>:151:1: error:; * Could not deduce (Num (b0 -> c)); (maybe you haven't applied a function to enough arguments?); from the context: (Num (b -> c), Num (a -> b)); bound by the inferred type for `it':; forall b c a. (Num (b -> c), Num (a -> b)) => a -> c; at <interactive>:151:1-13; The type variable `b0' is ambiguous; * In the ambig
2020-10-13 00:02:05 × acarrico quits (~acarrico@dhcp-68-142-39-249.greenmountainaccess.net) (Ping timeout: 240 seconds)
2020-10-13 00:02:24 × jedws quits (~jedws@121.209.161.98) (Quit: My MacBook has gone to sleep. ZZZzzz…)
2020-10-13 00:03:31 <Axman6> newhoggy: sure, though my take on this is that if you _do_ have a single strict bytestring, you should be abble to process it efficiently
2020-10-13 00:03:38 <newhoggy> Most likely use case is outside the cloud. Either locally or custom hardware.
2020-10-13 00:03:57 <Axman6> I would be interested to see the benchmark comparing both
2020-10-13 00:04:21 <newhoggy> Yeah. `hw-dsv` does have a strict mode which uses `mmap`
2020-10-13 00:05:34 nbloomf joins (~nbloomf@2600:1700:83e0:1f40:6800:17f1:f8ae:fc06)
2020-10-13 00:05:35 <newhoggy> I think it ends up slower in general because it has to build a `cspoppy` index.
2020-10-13 00:05:36 <Axman6> great - I couldn't remember from looking at the code yesterday
2020-10-13 00:06:25 <newhoggy> I don't have a data parallel version of the index building code :D
2020-10-13 00:06:45 acarrico joins (~acarrico@dhcp-68-142-39-249.greenmountainaccess.net)
2020-10-13 00:07:03 <newhoggy> I wonder if that is possible?
2020-10-13 00:07:21 × nbloomf quits (~nbloomf@2600:1700:83e0:1f40:6800:17f1:f8ae:fc06) (Client Quit)
2020-10-13 00:07:33 <newhoggy> Definitely interested in what the article does.
2020-10-13 00:07:36 × conal quits (~conal@64.71.133.70) (Quit: Computer has gone to sleep.)
2020-10-13 00:09:45 ahab joins (483206cb@adsl-72-50-6-203.prtc.net)
2020-10-13 00:09:50 snakemas1 joins (~snakemast@213.100.206.23)
2020-10-13 00:09:59 conal joins (~conal@64.71.133.70)
2020-10-13 00:10:00 nbloomf joins (~nbloomf@2600:1700:83e0:1f40:842d:ee09:f1b5:9acd)
2020-10-13 00:11:27 ahab is now known as mgs
2020-10-13 00:11:40 da39a3ee5e6b4b0d joins (~textual@n11211935170.netvigator.com)
2020-10-13 00:12:36 sep2 joins (49dc4892@c-73-220-72-146.hsd1.ca.comcast.net)
2020-10-13 00:14:43 × snakemas1 quits (~snakemast@213.100.206.23) (Ping timeout: 260 seconds)
2020-10-13 00:15:52 × falafel quits (~falafel@2605:e000:1527:d491:1ccf:5c8d:7928:e9cc) (Ping timeout: 260 seconds)
2020-10-13 00:17:50 <sep2> https://dpaste.org/V12z I want to go through entire list and see if they are all proper. On union
2020-10-13 00:18:22 <sep2> From what you see here, how can I just get my list?
2020-10-13 00:21:39 chkno joins (~chkno@75-7-2-127.lightspeed.sntcca.sbcglobal.net)
2020-10-13 00:22:03 ccallahan1 joins (~ccallahan@185.204.1.185)
2020-10-13 00:22:33 jedws joins (~jedws@121.209.161.98)
2020-10-13 00:23:54 × recon_- quits (~quassel@2602:febc:0:b6::6ca2) (Ping timeout: 244 seconds)
2020-10-13 00:24:49 × st8less quits (~st8less@2603:a060:11fd:0:9c66:9b18:c21:60c) (Ping timeout: 272 seconds)
2020-10-13 00:26:18 merijn joins (~merijn@83-160-49-249.ip.xs4all.nl)
2020-10-13 00:28:19 × Tario quits (~Tario@201.192.165.173) (Read error: Connection reset by peer)
2020-10-13 00:28:27 × quinn quits (~quinn@c-73-223-224-163.hsd1.ca.comcast.net) (Quit: ZNC 1.8.2 - https://znc.in)
2020-10-13 00:30:32 × ystael quits (~ystael@209.6.50.55) (Ping timeout: 256 seconds)
2020-10-13 00:30:35 × jedws quits (~jedws@121.209.161.98) (Quit: My MacBook has gone to sleep. ZZZzzz…)
2020-10-13 00:31:30 × merijn quits (~merijn@83-160-49-249.ip.xs4all.nl) (Ping timeout: 258 seconds)
2020-10-13 00:31:46 Tario joins (~Tario@201.192.165.173)
2020-10-13 00:32:21 × conal quits (~conal@64.71.133.70) (Quit: Computer has gone to sleep.)
2020-10-13 00:33:01 heatsink joins (~heatsink@107-136-5-69.lightspeed.sntcca.sbcglobal.net)
2020-10-13 00:33:50 jedws joins (~jedws@121.209.161.98)
2020-10-13 00:33:55 <lemmih_> sep2: That link is 404.
2020-10-13 00:34:27 quinn joins (~quinn@c-73-223-224-163.hsd1.ca.comcast.net)
2020-10-13 00:34:30 × chenshen quits (~chenshen@2620:10d:c090:400::5:f7f9) (Quit: My MacBook Pro has gone to sleep. ZZZzzz…)
2020-10-13 00:34:34 × bjobjo quits (~bjobjo@2a01:79c:cebf:d688::9e6) (Ping timeout: 244 seconds)
2020-10-13 00:34:57 × sep2 quits (49dc4892@c-73-220-72-146.hsd1.ca.comcast.net) (Remote host closed the connection)
2020-10-13 00:35:45 hyiltiz joins (~quassel@2601:14d:4100:112c:cff4:7114:bc84:3722)
2020-10-13 00:35:45 × hyiltiz quits (~quassel@2601:14d:4100:112c:cff4:7114:bc84:3722) (Changing host)
2020-10-13 00:35:45 hyiltiz joins (~quassel@unaffiliated/hyiltiz)
2020-10-13 00:35:59 × hyiltiz quits (~quassel@unaffiliated/hyiltiz) (Client Quit)
2020-10-13 00:37:19 × L29Ah quits (~L29Ah@unaffiliated/l29ah) (Ping timeout: 260 seconds)
2020-10-13 00:39:39 L29Ah joins (~L29Ah@unaffiliated/l29ah)
2020-10-13 00:39:55 ensyde joins (~ensyde@2600:1702:2e30:1a40:f09c:a012:516e:b253)
2020-10-13 00:41:27 bjobjo joins (~bjobjo@2a01:79c:cebf:d688::9e6)
2020-10-13 00:41:32 × mgs quits (483206cb@adsl-72-50-6-203.prtc.net) (Remote host closed the connection)
2020-10-13 00:42:13 × nados quits (~dan@107-190-41-58.cpe.teksavvy.com) (Read error: Connection reset by peer)
2020-10-13 00:42:47 nados joins (~dan@107-190-41-58.cpe.teksavvy.com)
2020-10-13 00:44:55 recon_- joins (~quassel@2602:febc:0:b6::6ca2)
2020-10-13 00:45:16 chenshen joins (~chenshen@2620:10d:c090:400::5:f7f9)
2020-10-13 00:46:12 thir joins (~thir@p200300f27f02580060eb7dde324e54c8.dip0.t-ipconnect.de)

All times are in UTC.