Issue 513683: email.Parser uses LF as line sep.

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/36035

classification

Title:	email.Parser uses LF as line sep.
Type:		Stage:
Components:	Library (Lib)	Versions:	Python 2.2

process

Status:	closed	Resolution:	wont fix
Dependencies:		Superseder:
Assigned To:	barry	Nosy List:	barry, bthooper
Priority:	normal	Keywords:

Created on 2002-02-06 09:35 by bthooper, last changed 2022-04-10 16:04 by admin. This issue is now closed.

Messages (4)
msg9148 - (view)	Author: Brian Takashi Hooper (bthooper)	Date: 2002-02-06 09:35
I'm not sure what the best solution is for this, but some email clients sent multipart MIME messages using CRLF as the line separator instead of just LF, which seems to be assumed in email.Parser.Parser._parsebody. Maybe I'm reading the RFC wrong, but it seems like it says that lines of a mail message should be separated using CRLF (although I'm sure many clients don't do that either)...
msg9149 - (view)	Author: Barry A. Warsaw (barry) *	Date: 2002-02-06 12:32
Logged In: YES user_id=12800 My philosophy so far (and I think this is documented in the latest rev of the .tex file), is that the email package should deal with native line endings, and that it is the job of a delivering mta to convert from rfc line endings (crlf) to native. It is certainly the case that smtplib converts from native to rfc line endings when sending the message out. Most mtas (e.g. postfix) when piping the message to a process or onto a file will convert to native line endings, at least in my experience. This may not be a very useful assumption though, and it is probably more robust to be able to deal with either line endings. There have been some movements in this direction in the cvs snapshot of the mimelib/email package where support for multibyte charsets (e.g. Japanese) have been added. You might want to check out that project's cvs trunk and see if it helps your situation, or submit a bug report there and we'll prototype the fix in that project first. Eventually all that code will be ported back to the Python 2.3 tree.
msg9150 - (view)	Author: Brian Takashi Hooper (bthooper)	Date: 2002-02-06 14:41
Logged In: YES user_id=450505 OK, that seems like a satisfactory answer. I do actually happen to be using Postfix on FreeBSD, albeit a little old (maybe a year or so), and am piping mails to a Python script, which is where I observed this problem. Maybe something with my local setup? (I didn't set up Postfix, but I don't see why it wouldn't be doing the default thing) Maybe it would be safer not to make assumptions about the input message, and process line endings to native before parsing? This would be my vote anyways (I tend to avoid thoroughly reading documentation unless something doesn't work as I intuit it should :-)
msg9151 - (view)	Author: Barry A. Warsaw (barry) *	Date: 2002-03-15 16:51
Logged In: YES user_id=12800 Closing this as Won't Fix because the email package's philosophy is to process messages uses native line endings. Any conversions to/from RFC line endings must happen outside the package (e.g. by the delivering MTA, or by smtplib for outgoing mail).

History
Date	User	Action	Args
2022-04-10 16:04:57	admin	set	github: 36035
2002-02-06 09:35:21	bthooper	create